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DISTRIBUTION OF THE RATIO OF THE MEAN SQUARE SUCCESSIVE 
DIFFERENCE TO THE VARIANCE 


By JoHN von NEUMANN 


Institute for Advanced Study’ 


1. Introduction. Let z,,--- , z, be variables representing n successive ob- 
servations in a population which obeys a distribution law 


~(e—t)2/202 1 
ce (x—§) */20 dx, (« = — an); 
oV2n 


i.e. which is normal, with the mean é and the standard deviation o. For the 
sample we define as usual the mean, 


the variance, 


and also the mean square successive difference 


n—I1 


sais 2 : i au (ay41 = Ty)’. 

The reasons for the study of the distribution of the mean square successive 
difference 6°, in itself as well as in its relationship to the variance s’, have been 
set forth in a previous publication’, to which the reader is referred. The distribu- 
tion of 6°, and in particular its moments, were also studied there. The present 
paper is devoted to the investigation of the ratio 


A comparison of the observed value of 7 with that distribution is particularly 
suited as a basis of the judgment whether the observations 7, ---, Z, are 
independent or whether a trend exists. (Cf. sections 1 and 2, loc. cit.”) 

The moments of 7 have already been determined by J. D. Williams by a 


1 Also Scientific Advisory Committee of the Ballistic Research Laboratory, Aberdeen 
Proving Ground. 


2 John von Neumann, R. H. Kent, H. R. Bellinson, B. I. Hart, ‘“The mean square suc- 
cessive difference,’’ Annals of Math. Stat., Vol. 12 (1941), pp. 153-162. 
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different method.’ Williams’ results have been checked by W. J. Dixon at the 
suggestion of 8S. S. Wilks, whose stimulating interest has been largely responsible 


for the undertaking of the series of papers on 6 and r. The present rather 
3 


exhaustive discussion, however, brings out several other essential characteristics 
of this statistic, and provides the key to some very effective computational 
methods. It is further hoped that the reader will find that the mathematical 
methods used and the generalizations indicated have an interest of their own. 
From the latter point of view the final results of sections 5 and 7, concerning 
the distribution of values of quadratic and of Hermitian forms, may deserve 
special attention. 


















2. Diagonalization of the quadratic forms and replacement by a spherical 
mean. Since 5 and s are unchanged when we replace each x, by x, — &, we 
may assume § = 0. Then the distribution law of z is 


x2/2o2 ] 


n 
: —z2/2¢2 
ce dx, and that of 21, ---, 2, is [] ce *#’*” dz,, 
=] 





dz, --+ didn. 














Any linear orthogonal transformation of the 7, ---, 2, leaves > x and 
aml 
dz, --- dz, unchanged, hence the above distribution law will likewise be left 


rm . . 2 2 ° 
unchanged. Thus, we may subject the two quadratic forms 6, s° to any simul- 
taneous linear, orthogonal transformation. 


° . . , / . . 
Consider one carrying 2, --- , 2, into, say 2, ---, Z, , which brings the 
n 
2. ‘ 12 a 
quadratic form (n — 1)é into the diagonal form, say A,«,. Such a trans- 


p=1 
i — . 4 
formation does not affect the characteristic values of the quadratic forms, and 


n 
as ° . 12 
these characteristic values are obviously A,,---, A, in the case of >> A,z,’. 
p=1 
Consequently A;, --- , A, are the characteristic values of the original quadratic 
2 Ty . . . 
form (n — 1)6. Weshall determine them as such in the next section. 


Clearly we always have (n — 1)& = 0, hence all A, = 0. Some A, may 









3 J. D. Williams, ‘‘Moments of the ratio of the mean square successive difference to the 
mean square difference in samples from a normal universe,’’ Annals of Math. Stat., Vol. 12 
(1941), pp. 239-241. Cf. also L. C. Young, “‘On randomness in orded sequences,”’ Annals 
of Math. Stat., Vol. 12 (1941), pp. 293-300. 

4For the properties of matrices and quadratic forms cf. e.g.: J. H. M. Wedderburn, 
Lectures on Matrices, Amer. Math. Soc. Colloquium Publications, Vol. 17, New York, 1934. 
In the present context cf. mainly Chapters II and VI. 
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equal 0 say k (= 0, 1, ---, n) of them, which we can arrange to be Anz, 
Te 
(n — 1)s = 0 is thus equivalent to mse =e ,= 0, i.e. to n — k inde- 
pendent conditions. On the other hand this amounts obviously to 7; = --- = 


z,, and these are n — 1 independent conditions. So k = 1 and consequently 


Ai,:*:, Ana > 0, A, = 0. And our linear orthogonal transformation must 
. . . / 
carry the z-vectors with 2, = --- = x, into the 2’-vectors with 2, = 
’ 1 1 
= 2,-1.1 = 0. Among the former, Va’ KOR rf has the length 1; among 
n n 


the latter only 0,---, 0, + 1 have. Hence these correspond to each other. 
Now the scalar (inner) product of two vectors is an orthogonal invariant, that 


: 1 . s —_ , ’ 
of a vector 41, --+ , , With —~, --- , ~~ is ~/né, that of a vector 271, --- , 2, 
n Vn 

° . , 
with 0, --- , 0, +l is +2, , hence 


/ 


/né = +2. 
Putz, = + u,. Thenclearly >> u, = 0. Hence 
p=! 


2 12 2 
+ Zz. Up = Xn + ns’. 


u=1 


9 


NX 


Owing to the orthogonality, the left-hand side is equal to >> z, therefcre 


p=1 
n—1 


2 12 
ns’ = >> x,’ 
p=1 
Remembering that A, = 0, we also have 


n—1 


(n — 1)8 = a A,yt,- 
~_ 


Consequently 
n—1 

9 

i. 

n up 


a=-i 


u=1 


p= 


wat . , i a‘ ‘ ’ ’ i 
The distribution law is, as we know, the same in 2) , ---+ , 2%, ASIN 2, +++ tn, 
namely 


/ / 
dx; --- dz,. 


. / / 
Thus xz}, ---, 2, are independent. 1 depends on 2), ---, 2,-1 only, hence we 
. / . . . / / 
may disregard z,, altogether, and use the distribution law of the 21, --- , Zn, 


—1 
—"D 212/202 , , 
= = 
ce dx, +--+ d%a-1. 
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ye , , , , a 
With respect to 71, --- , Y,-1 We may now state that the 71, --- , ©,_; dis- 
tribution of 7 can be obtained by determining first the distribution of 7 over 
every spherical surface 


n—l 
Sagas 

Lt =7 
u=1 


and then averaging these distributions with the weights ¥(r) dr, where y(r) dr 
is the probability of the spherical shell from r to r + dr with respect to our 
original x, , --- , x,-1 distribution law. (It happens to be c’e”?”*r" dr, but 
this is immaterial.) 

Since the 21, --- , 24-1 distribution law is obviously spherically symmetric 
in these variables, the first-mentioned distributions over the spherical surfaces 
are readily obtained by assigning each piece of the surfaces in question its own 
relative, n — 2-dimensional area as weight. 

Since 7 is a homogeneous function of Xi Sia x, _, of order zero, these spherical 
surface distributions of » are the same for all r. Consequently we can replace 
all these r by, say r = 1, and the subsequent averaging over the r may be omitted 
altogether. 

Finally, since we restrict ourselves to r = 1, i.e. to the spherical surface 


the denominator of 7 may be omitted and we have 


n—1 
n 49 
= ——_ As. . 
n=" Aa, 


==] 


We sum up, writing again 7, --- , tn, for 21, «+: , 2n-1, then the desired 
distribution of 7 is that of 


n 2 
a, oe 
’ n—1% ee 


where the point 21, --- , Y,-1 is uniformly distributed over the spherical surface 


Here Ai, --- , An-: are all positive, and together with 0 they are the charac- 
teristic values of the quadratic form 


a ee 


u=1 


n—1 n—1 
2 2 2 
tit 2 Doaet 2, — 2 Do tp%ei- 


p=2 p=1 
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3. The characteristic values A, ; first orientation concerning 7. We have 
shown that there exist (counting multiplicities) precisely n — 1 positive roots 
A of the characteristic equation 


A-1 1 
1 A-2 1 
1 A-2 1 


A-2 1 | 
1 A-1\| 


(the empty places are filled with zeros), and that these roots are the 
Ai,°°° ail 


Such an A is characterized by the possibility of solving the equations 
(A-—l)a+a=0, 1+ (A — 2)r+ 43 = 0, t+ (A — 2)23 + % = 0, 
Yn-2 + (A — 2)tn1 + 4n = 0, tai t+ (A — 1)z, = 0, 
- , a, not all equal to zero. Put 


to = 71, Inti = In; 


A = 2 — 2cosa, 
then these equations become 
Ly-1 + Muu = 2 cos a-zy, for p=1,2,---,n—I1,n. 
The last equation is satisfied by 
ty = 2 cos (u — 3)a for wp=0,1,2,---,n—l1nnt+l. 


Now zo = 2; is automatically fulfilled, while 7,4: = z, demands cos (n + })a = 
cos (n — 3)a. This is certainly the case when (n + 3)a = 2kx — (n — })a 


(k any integer), i.e. a = ~ For nok = 1,---,n—larem,-:-:- , in all equal 
n 


to zero (indeca x, = 2 cos = > 0) , hence these k give A’s of the desired kind. 
They are 


Aud-teet! otal (k=1,---,n—1), 
n 2n 


and so they are all positive and different from each other. Their number is 
n—1. Hence they are precisely Ai, --- , An. 
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So we have shown 





A, = 2—2cos™ =4sin?™ (u=1,---,n—1), 


n 2n 





We can now reformulate the final result of the preceding section. 
Let us set 








2n 
= 1 —e). 
sma" ‘) 


Then 


n—1 
-_ >» er 2 
€e= cOS —-d,, 
p=1 nr 















where the point 21, «++ , 2,1 is uniformly distributed over the spherical surface 


| 


n—l 


uw 





Replacement of xz, by z,_, carries e into —e. Therefore the distribution of 
€ is symmetric around 0. Hence the mean of ¢ is 0. The maximum of ¢’s 
a — e . aa ‘ (n — 1) 7 : 
distribution is clearly cos — , its minimum is cos ) = —cos—. Westate 
n n n 
these facts, together with their equivalents for 7. 


a ee ee ; ; a 2n ‘ 
e (m)’s distribution is symmetric around its mean, which is 0 ( . The 
n— 


. ; ee ae T 2n T 4n oT 
maximum of ¢ (7)’s distribution is cos — { — 1+ cos— |= cos —}, 
n\n-1 n n—1 2n 


: nis ; 7 2n T An of 
its minimum is —cos — 1 — ecos—]| = sin’ —}. 
n\n -1 n on | 2n 


Thus it will be easier to obtain information concerning 7 by considering the 
distribution of €, since all odd moments of ¢€ are zero, etc. The investigation of 
e instead of » was first suggested by B. I. Hart, who also found, that the first 
four odd moments of € vanish. R.H. Kent and B. I. Hart also determined the 
minima and maxima of these distributions for certain small values of n. 










4. Direct computation of the moments. 
law of a quantity 


We shall investigate the distribution 


m 


nr 9 
y= Bur. 
p=1 
where the point 2; , --- , 2m is equidistributed over the spherical surface 


m 
Z x = 1. 
p=1 


(ou above ¢ obtains by putting m = n — 1 and B, = cos — : 
n 
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We denote the mean of any function 
f(a, Rey » Su) 
over the above-mentioned spherical surface (the 2, --- , Xm being equidistrib- 
uted over it) by 
* » Mak 
Our primary objective is to determine the moments of this distribution 


alles “fm \P 
M,=y7 = (> By) (p = 0, 1, 2, 


p=1 


~ 


Let us write ~,, for the (m — 1-dimensional) area of the above-mentioned 
spherical surface (of the unit sphere in m-dimensional Euclidean space). 
Now we form the function 


oo x = m 2 
2 _— ee. > 
f(z) = | ete | @ s=1 47h © wi *. day eee a 


oo — oO 


(This integral, as well as all others which we are going to derive from it, is ob- 
viously convergent, as long as z is sufficiently small. More precisely this is true 
when 


| z|-Max (| Bil, ---, | Bm |) 


We shall use them only in the neighborhood of z = 0.) Now clearly 


co 0 m Pp m 2 m 
2 z > Buzz —™ 22 

[ -“ @ | (> Bz. @ uw=1 “i Fe p=l *dz —— 2 dim 

— —co \u=l z=0 

fo) oo m Pp m 9 

> 2 -> 

| Poy [ Zz But, e v= * day eee AXm 

— OD —=e uw=1 


oo oo m Pp m ‘ 
2 -™” 
[ patie [ M, ( Ly @€ u=1 7B dx, —e dim 
— 00 — 00 u=1 / 


@ 
2p reg m—1 ” 
[ ar ¢ x6) 6S 
0 


eo 
. —r2 2p+m—1 5 
2m u, [ é : r? = dr ; 
0 
@ 


—u ptim-1 
4=,.M,] e*uPt" du 
0 


5 Introduce the new integration variable u = r?. 
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On the other hand 
f(z) =_ [ ies [ oh (1— Bu2)75 dy, -++ dm 






el Buedep dz, 6 


[401 — Beye [ 





wa =martargy 














T(3)" Ble), 


where 


Be) = I] a - B,2). 


42M T(p + m) = : pre zy” = 


For p = 0 this becomes, since My) = 1, $(0) = 1, 


1 m 7. 
~ Sta) = Fos . 
2"(5) =") 
Dividing the former equation by the latter gives, since 
m 
(+5) _ m(m ei). fs»-a). 
r *) , . 
2 


M, = {a Be rt 


5G) GH)" 


In order to make a practical use of the above formula, we compute 


Thus 





In (B(z)*) = 41> In (1 — B,2) 
2, 






6 Introduce the new integration variable u = (1 — By,z)r?. 
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1< 
a= 5 2, By, 


p=1 


@12+a922+a323+... 


= € 


1 + Biz + Boz” + Bz + +s, 


1-2---p 


m m 
(@41)---(t4p-1) 


By Q1, 
Bo = ae + 3a, 

3 
Bs a3 + aa, + gQy ’ 


2 2 4 
Bs = ag + S05 + aia3 + FajQ2 + pray . 





Bp. 


Clearly 


In our application (cf. above) 
Bau, = —B,. 
This has the consequence that 
= a;,=—=a,=:-- = 0. 
Thus the z functions we compute contain only even powers of z and consequently 
Bi = Bs = Bs = -::> 
M, = Mz = Ms=-:-:- 


= o, 
a 1.2 
— i + go ’ 


3 
= ag + ara, + $2 , 


2 2 4 
= & + 5Q4 + A206 + Faas + Brae . 
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° Tv 
As mentioned before, we actually have m = n — 1 and B, = cos =. Ce 
n 


sequently 


l ‘nad 
‘7 ger 4. —iprl|n l 
a\ oy (Hr + 


l 
> p l fer 7 
21] St = \k 


1 » : > geeG—atdin 
) 


p=1 


ee > @ b> eitmek-BD In i} 
2'+1] 720 \k L . 


The inner sum has obviously these values 


n—1 _ =n if k — $l is divisible by n 
bs etre e—ttyin Y 


u=0 0 otherwise. 


z (i) - 


Consequently 


n rfl 1 
= gray 2 (;,) ~ 9? 


where >,’ extends over those k = 0, ---, 1, for which k — 41 is divisible by n. 
k 


Let us now determine the & occurring in the following sum (as above, k — 4 
is divisible by n) 0’. k = 4lis clearly one of them. All others are of the form 
k 


k = 31 + hn, h = 1, 2,---. The term contributed is the same for + and 


for —, since 
ls )- li : ) 
yl + hn 41 — hn} ~ 


= 0, for l odd, 
ay 

_I1jn l \ . 

~ = ‘ (4,) a § )| IP for 1 even. 


7 As pointed out above, we need to consider only the even I. 


So we have 
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The number of terms which the sum > ,contributes depends on the comparative 
h=1,2,- e 
sizes of land n. The number is clearly 
0 for al <n, 


lfor n S 31 < 2n, 


(0 for n = 1), 


1 


= 1, 25 3a 


35n — 128 
* a (0 for n 
Bs = Bs = Br = By = ::: 
n—-2 
8 ’ 


n> + 2n — 12 
- = (0 for n = 1, 2), 
n® + 12n” + 8n — 168 5 ) 
3079 : (0 for n 1, 2; 1024’ n=3), 


(0 for n = 1), 





_ + 28n! + 212n* — 64n — 3696 (9, 
98304 


1, 2; 
35 35 
annem = > = 4 é 
32768’ " — 3) 2048°” ) 
M, = M; = M; = M; = My, = coe = 0, 


— —~  na-2 
m—1DntHD™ Mm—NHm+)d’ 


M, = (0 for n = 1), 


8 The author wishes to express his thanks to Miss B. I. Hart for her kind help in carrying 
out these computations. 
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384 1B, = 3(n* + 2n — 12) 
Din+iI(n+3)\(r+5)° HM —DYDmt+)@t+3)(n +5)’ 


(0 for n = 1, 2), 





46080 B 

Din + I(n+3)\(n+ 5)(n+7)(n+9) 
. 15(n° + 12n? + 8n = 168) —_ 

Din + 1(n+3)n+5m+7)(n4+9)’ 


(0 for n = 1, 2; resz, n = 8). 








10321920 


(n—1(n+ )D(int+ 3)(n+ 5 (n+ 7(n+ 9(n+ 1D(n+ 13) ° 
105(n* + 28n* + 212n° — 64n — 3696) 


~ (n—-Nnt)M+F3 n+ 5n+7D(n +9 (m+ 1m + 18)’ 
(O for n = 1, 2; sx%7e5, n = 3; ate7o, nm = A). 


We conclude this section by obtaining asymptotic formulae for the distribu- 
tion of e whenn > ~. 

In this case our formulae show that all a; (1 even) behave asymptotically like 
constant multiples of n. It also appears from our formulae for the 6; (J even), 
that 


a. 


B= api? 


~ 


4s polynomial in a2, a4, «++ , a2 of total order <= 4/1 — 1. 


1 yy. ‘ ; 
Consequently GD! a; is the dominant term in this expression, and so we have 
2 . 


asymptotically 


Bb ~ el i =: 
GD? GD1\8/ ° 


From this 


2 


l! i f1\® 
hn ho ee 
ft (y" ami (as) 


Now the normal distribution 


on 2/207 es 1 
ce” '! dy, (« = = 
G1 / 2m 


with the mean 0 and the standard deviation co, has the moments 


mi -/ y' cre Yat dy. 
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This is clearly 0 for 1 odd, while for | even’ 


2 
t+1 +1 = l-—1 
ee a2 | ot ak dy 
0 


= Qh) git ep (2% 
9 : 
For 1 = 0 this becomes, since m) = 1, 
l= BoieT (4). 


Dividing the former equation by the latter gives, since 


r(‘tt 


l! ( 
(AD1\2/) ° 


2 
Comparing the formulae for 17; and for m; shows that M; ~ mz if - = 


o 
Q’ 
Vi 
1 = —. So we see: 
n 
For n — © the distribution of « becomes asymptotically normal, with the 
1 


mean 0 and the standard deviation o, = 2 . (The same result could be ob- 
n 


tained by applying the general theorems of Liapounoff and others.) 


5. The distribution law, general discussion. We return to the quantity vy, 
defined at the beginning of the preceding section, of which our ¢ is a special case. 
We wish to obtain direct information concerning the distribution law of this y. 

Since a permutation of the B, is permissible, we arrange them such that 


B,2B,2---2 Ba. 


(In the special case y = e, the B, = cos Hm are given in this arrangement.) 
n 
The distribution of y covers obviously the interval 
B, é y z Ba . 


And if not B; = --- = Bn, ie. if B, > Bm, which we assume to be the case, 
then we have obviously a continuous distribution law for y in this interval. 
We denote it by w(y) dy. 


2 
* Introduce the new integration variable u = a. : 
me 
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Assume for the moment that B,, > 0. Then the quantity 


ym = - (5 Bx ." 


is bounded, and we can therefore form its mean value. This is the "- moment 


of y (cf. the beginning of the preceding section) 
Min =" = -(3 B tt) Z 
= [. y*" w(y) dy. 
With any twoa > b > 0 (we shall have = 00 subsequently ) form the 


quantity 
—}m 
t(a, b) = 7 | (> 21) da, +++ dtm” 


. z2>b2 
*e 1 m= 


“dr 

- —1 
[rn dr = 2m [ ae 
b a 


Consider next 


2 ll 
s(a, b) vee 23) dz, +++ dtm 


4m nn 
I] B, dx -++ dam 


onl 


M_im / [I B, t(a, b). 


u=1 


10 Concerning this transformation to polar coordinates and the quantity =» ef. the first 
part of the preceding section. 


1 Replace each variable z, by ~/ Butz. 
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On the other hand, a comparison of their respective integration domains makes 
it clear that 


t((B,a, Byb) S s(a, b) S t(Bia, Bb). 
Thus 


Bya 
> ae | ” " eo 
z ny, sz me a 


= 31 


1 5+ in 


m a 
i B In , 


p=l 
i a 
Now let — co, then 
1 
II B, 
p=1 
obtains, i.e. 


—}m =f y tm (y) dy = 


Vie. 


We now drop the assumption B,, > 0. We consider instead a real number 
z with z < B,,. Replace each B, by B, — z. Then the one with u = m be- 
comes > 0. And y is obviously replaced by y — z. Consequently our above 
equation is now valid in the form 


(y - 2)" = [ (y — z) wy) dy = a 5 
7 Ihe.» 


Let now z be a complex variable. The second term of the above equation is 
a (locally) analytical function of z, except in the (real) interval B; = z = Bn 
The third term, too, is a (locally) analytical function of z, except at the (real) 
points B,,---, Bm. Consequently both are one-valued analytical functions 
of z in the simply connected domain which obtains from the complex z plane by 
exclusion of the (real) half line 

Ba: 

Hence the equation 


(1) [Ow - 20) dy = —4 
. II (B, saa 


p=1 
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which holds for all (real) z < B,» , remains true for all complex z of the above 
domain.” 

We observe next that w(y) is an analytical function of y in B; 2 y 2 B,,, 
whenever y ~ B,,---, Bn. This is easily established by using any multiple 
integral expression for w(y) which, while hard to evaluate explicitly, puts this 
analyticity into evidence.” 


1 
12 (y — z)~ and the factors (By — z)~ of —— >> are those branches of these 


I (3, — 


analytical functions which are (real and) > 0 w eas is (real and) < B,. When m is even 
(as it will be, ef. below) the domain of analyticity is somewhat more eteniied. but we need 
not discuss this. 

13 The computation which follows gives the desired analyticity in a simple way, and also 
makes it clear why the analyticity fails aty = Bi,---,Bn. 

Consider the y ¥ Bi ,--- , Bn in B, 2 y 2 B, . The probability of y S y is p(y) = 


y 
/ w(y) dy, and we may establish its analyticity instead of that of p’(y) = w(y). 
B 


m 


m ™ 
Obviously p(y) is equally the probability of p Bx sy a a2, if the 21, +--+, tm are 


w= u=1 
m 


equidistributed over a spherical surface a zz = r’, with any given r > 0. 
p=1 
Our hypotheses concerning y imply By > y > By,: for a suitable v = 1, --- ,m — 1. 
Consider now the expression 


age 
fy = e u=1 " dx, +--+ dtm. 


m 9 
2< pe 

> & utp Sy. S = 

up=1 =1 


Transforming to polar coordinates, we obtain 


fly) - | eT - Sm p(y)r™—! dr 
0 


@ 
= zm | eT r™—! dr- ply). 
0 


(=m as before.) Hence it suffices to establish the analyticity of f(y). Now on the other 
hand z 


5 <2 
S(y) fx } é veh dz, eee dxzm 


v 
, > 2 
2, (By ~y)z, S-% (y—By) zz 


a+1 


1 , § 4 eed "F178 ay. date 


Tie. - S we< 
uff * BS, 2a, wh 


u 


(We introduced the new variables w, = V\ By — y | zy.) And this expression is clearly 
analytical in y, since By > y > Bysi. 
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We shall need only the fact that w(y) possesses 3m continuous derivatives at 
these places. (m will be assumed to be even, cf. below.) Its behavior at 
y = B,,--+, Bm willfollow from our subsequent results in all cases where we 
need it. 

In order to determine w(y) from (1), as we now propose to do, it is very con- 
venient to assume that m is even. We therefore make this assumption, and 
shall maintain it throughout most of what follows. 

Consider a yy ¥ Bi,---, Bmin By = yo = Bn. Then B, > y > By, for 
a suitable v = 1,---,m-—1. Now put 












se 






z2= y+ it (t real and > 0), 


form (1), take the imaginary parts of both sides, and let t — 0. 
Consider first the left-hand side of (1). Since w(y) possesses 3m continuous 
derivatives at y = yo, we have 







}m—1 


uly) = > Oly — wl’ + ey — wo” 


k= 








with a bounded e(y). 





Clearly 


1 {d' 
a, = kl {25 ow} 


Thus, since w(y) is real, all 6; are real and e(y) is also real. 
Compute now the contribution of each one of the 3m + 1 terms in the above 


expression for w(y) to the imaginary part of the left-hand side of (1). 
The last term, e(y)-(y — yo)”, gives 


By By = jm 
8 — w— Welady — way = gf (—4— 1) a) a 
Bm B, \Y — yo — tt 
The integrand is uniformly bounded, and so the reality conditions cause the 


entire expression to — 0 fort 0. Hence the contribution of this term is zero 
fort > 0. 








m 
‘s 





The other 5 terms correspond to k = 0, 1, --- — 1, the k term being 


her 





By 
8 fy = yo — i -Ouly — yo)*-dy 





By 


(y — Yo)" 
6, qacennneditoaaaaas 
S Bm (Y — Yo — it) 








dy 





nS 





; d 
(y — yo — it)» , 
k 
arly 





= 0; 2 (7) 3{(co" [ (y — yo — it)**™ in}. 


m 
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The exponent k — h — 3 in the integral is always S (= - 1) —-Q- _” 
m 


and it is = —1 if and only if k = ._~ 1, kh = 0. Consider first a term where 
this is not the case, i.e. where the exponent k — h — 3 <-—-1l. Forsuch aterm 


the expression ${---} becomes 


“\h 1 *,;\k—h-—3m = 
X(it)’ —— {(y = = it)* -s on 


kK-h-gt+l 


For t — 0 the last factors are bounded and real, and so the entire expression — 0: 
for h = 0 because of the reality conditions, for h > 0 because of (it)" > 0. 


Thus only the term k = 3 — 1,h = 0 can contribute something else than zero 


for t — 0. 
Now this term is equal to 


Aim {In (y — yo — tt) }o=a1,, 


and for t — 0 this converges to 


1 
+r fa 1 


(@ — 1): =o} 
2 


OimaS(ir) = 1Oima = 


Thus the imaginary part of the entire left-hand side of (1) converges for 
t — 0 to this expression. 

The right-hand side of (1) is easier to discuss. The imaginary part under 
consideration is now 


[oa ER 


———— il 
I (B, — yo — tt) 
Considering” (its y is our yo + it), this converges for t > 0 to 


$11, - wr IL, sw - 8)? = gyn 1 
Jina — yo| 


14 This evaluation {In (y — yo — zt) =. — in is based ont > 0, and the fact that y moves 
on the real axis from B,, to B;. It has no connection with”. 
15 The square roots of the (real and) > 0 quantities 


Bu — Yo (u = a, “es, 0), Yo— By (u =v+1, ses, m), and ][!4. - yo| 
a1 


are taken to be > 0. 








or 


ler 


yves 
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If v (hence m — v) is even, then this is zero. If v (hence m — v) is odd, then 


this is equal to (—1)*"-"-? pence . Thus (1) becomes the following 
Il | B, = Yo | 


equation: 


‘ qin = 0 if v is even, 
7 
1) 


3m—1 ts : 
! dy y=V0 = (~grr"** -—epsaepempensine if v 18 odd. 
/ Il | By a Yo | 
pal 


Simplifying this, and writing y for yo , and also restating the definition of v gives 



















gin = 0 if v is even, 
dyn w(y) (2 : ) | 
(2) = (—1)» \2 = ; = ff vis oda, 
Uf |B, — y| 
B,>y> Bus,v =1,°+:,m—1. 






Observe finally, that if we put 
A(y) = I] (y — B,), 
aa 


then this product has v factors < 0 (u = 1, --- , v), while the others are > 0. 
So 





9 > even 
W(y) 4 0 for v odd ’ 





and in the latter case 
I] |B. — y| = —QAy). 
a 


It is clear how we may now rewrite (2). 
We are now in a position to determine the behavior of w(y) aty = B,,---,Bm 


too, since we know how its —- 1-th derivative behaves in the immediate 


vicinity of these places. (2) shows that it is singular there, and that the nature 
of the singularity depends on the number of the u, for which B, is equal to the 
y in question, i.e. on the multiplicity of this root of our polynomial Y(y). 

In our actual application (to y = e¢, ef. the beginning of this section) the 
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B, are pairwise different, i.e. all root multiplicities of 2(y) are equal to one. A 
further special case, which has a certain interest of its own, is when the B, are 
equal two by two, but otherwise different, i.e. all root multiplicities of 2(y) are 
equal to two. In the discussion which follows we shall therefore assume that 
one or the other of these two cases occurs. 


qe 1 : 
In the first case dy w(y) has on each side of a y = B, one of these two 


behaviors: It is identically zero, or it is singular, of the type Thus 


a tae 
VIB. — yl 


w(y) is continuous on each 


iia ; eo 

it is at any rate integrable. Consequently — 

0 y m—< 

side of y = B,, i.e. for both y = B, + 0. Successive integrations give now 
dé 


that all — pO), k= 0,1, 45 — 2, are continuous for both y = B, + 0. 
= B> B= B,>---> Bai= Ba. So 


4m—1 


l 
the v with B, > y > B,4, is necessarily even, and w_ i ; »(y) is identically zero 


In ue dial case we have B, 


4m—2 


for all of (2). Consequently Re ~a7» (y) is again continuous on each side of 


y = B,, i.e. for both y = B, + 0. Successive integrations show again that all 
k 


iow) k= 060, 1,---, 5 — 2, are continuous for both y = B, + 0. 


k 
We must therefore discuss only how much the i , oy), k = 0,1, 


change from y = B, — Otoy = B, + 0. 
Let us return to the procedure by which we derived (2) from (1). We put 
again 


2=ytit (t real and > 0) 


and let ti — ©. But we consider now (1) itself (and not merely its imaginary 
part), and we choose a yy = B,. 


Consider first the left-hand side of (1), always disregarding terms which stay 


Byta 
bounded fort — 0. Then we can replace the integral . of (1) by any fC 


By 


with any fixed a > 0, and this is’equal to 
By—0 Byta 
L, *1..: 
By—a Byt0 
We choose this a > 0 so small that no B, ¥ B, lies between B, — a and B, + a. 


k 
Le. all ee wy), kK = 0, 1, ~—_— 2, are continuous from B, — a to 


‘.% 
B, — 0 and also from B, + 0 to B, + a. 
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This being the case, we can evaluate the above sum of two integrals by > — 


successive partial integrations. Thus we get 


y=B,—0 
- (G — k)! B 4m+1+k d* 
a cei — B, — it)” a 
= § ~ :) (y it) dyt aly) 


2 y=B,—a 
y=Byt+a 
m—2 (* -2-k)! at | 
a dX Fi sensei (y — B, — i w(y) 
oe a ane ! 
G 1) foal 
1 Byta 4m—1 
+ gf (Y — By — i Fr lu) dy. 
ep 


In the first two lines the y = B, + a terms are bounded for t — 0, therefore 
only the y = B, + 0 terms need be kept. Then the first two lines give 


=~—-2- k) k y=By+0 
gE =8—)! ymin ft sg) 


m Selenite 
_ ! v 
(3 (B-1)1 


up to terms which stay bounded for t > 0. Consider now the third line. We 
4m—1 C2 


know that the . w(y) in its integrand can be majorized by “en 1 


dy — B.] — (for 


a suitable constant ce , cf. our discussion preceding the present one). Thus the 
integral in question is majorized by 


Byt+a 
[ly - Be - itlealy — Belay, 
By—a 


hence a fortiorz by 


[ ly — B, — it|"c|y — B,|*dy* 


cot? [ix - oe 


17 


a fo Vu + 1 = 1)-Ju| 
° dv 

Vet +1 

y — By 
t 

17 Introduce the new integration variable v = V|u|. 


t? 








16 Introduce the new integration variable u 
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Since the last integration is obviously finite, the entire expression is O(t™*) for 


t— 0. 
Consequently the left-hand side of (1) is equal to 


Byt 


wea ( - 2 - )! a 
(gy er“ ‘a ow) + o(t), 


k=0 m 
es oom ! 
: 1) 
k 


«heat pie ) at y = B, + or B, — 0, respec- 


fort— 0. (For B, 


tively, must obviously be taken to be zero 
Consider now the right-hand side of (1). 

We first 0 the B, are pairwise different 
smenppenenenecenmcermmescenccss: iy. Gig). 


tion is 
(/ I «, ee 
et --> > Ba-1 = Ban. So we 


p= 
Secondly let us consider B; = B, > B; = By > 
*) The right-hand side of (1) becomes now 


may assume v = 2A (A = 1 5 


The right-hand side in ques- 


. ‘ i 12 ‘ 
(The sign is determined by.) Soin our case 


a rational function, i, 
ne (Bx — 2) 

a — 
ei (—it) + O(1). 


= 
Mi e. 
Box) 


tia 
I] (Bo, — Bo — tt) Ta (Box — Boy): i, (Ba — 
thane these with our iain aati: gives therefore (for t — 0) 


jm—2 (3 <_ k) weld — 
ae A (un ee 3. ow} 
y y=B,—0 


m 
oan 
2 ) 
O(t) in the first case, 


— ae sl se 
' — (—it)~ + O(t”’) in the second case. 


2 (Bs — By): I. (Ba — Bax) 
t). Hence the O(t”*) 


In this formula the left-hand side is a polynomial in (—7é) 
terms on the right-hand side must vanish, and otherwise all powers of —7t must 
Consequently 


a™ y=B,+0 
diy' owt 


have the same coefficient on both sides 
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must vanish, except in the second case for the one value of k with 


-5 +1+k = -liek = 3 — 2. So, with this one exception, we have 


d* d* 
‘F oO) . ‘a Wh ae. 


And in the exceptional case (second case, v = 2d) 


4m—2 y=B,+0 1 
{a ow} 2 at 
y By? Il (By, — Bo) II (Ba. — Bax) 
bani k=A+1 


k 
Thus in the first case all derivatives = w(y),k =0,1,---, 7 — 2, are con- 
Y 
tinuous even at y = B,,---, Bn. 


In the second case the same is true fork = 0,1, ---, 7 — 3, but the deriva- 
tive with k = 3 — 2 behaves differently for y = Bi, ---, Bm. Indeed, for 


y = Boi = Ba Q =1,---, =) this derivative is continuous for both y = 
B., + 0, but it increases from B,, — 0 to Ba + 0 by 





1 
(~ aii (= 1) la 3m Te 
II (Bx — Ba) [I (Ba — Bu) 
k=1 k=\A+1 


k 
(at y = B, + 0 and B,, — 0 the E w(y) must be thought to continue with the 
value zero.) 


These rules, together with (2), determine w(y) completely. 


6. First special case. We consider the first special case, where the B, are 
pairwise different. We immediately specialize further, toy = ¢,i.e.m = n — 1, 


B, = cos = (u=1,---,n—1). (Cf. the beginning of the preceding section.) 
Since m must be even, nm must be odd. The rules of section 5 determine 


k 
w(y); in particular all derivatives . w(y),k =0,1,---, > 
dy* 2 
where continuous, beginning and ending with zero at y = B, and B,4, 
respectively. 
In the even intervals 


— 2, are every- 


B, = y 2 Bs, Bs z>y = Bs,--+, Bas 2y = Br, 
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3(n—1)—-1 
the derivative gio w(y) is zero, i.e. w(y) is a polynomial of degree 
y n 


4(n — 1) — 2. In the odd intervals 


B,2y 2 B,, Bz = y = By, ---, Bn-s 


IV 


y = B..a,; 





we have 

w(y) = + corps 
V-Uy) 

(the sign + is alternating (—1)*"?, (—1)*"- >, .-- , +), where 


n—1 
A(y) = I] (y — cos . 
p=1 n 


Another expression for %{(y) may be found by the following method. 


"ale (jin—-1}-1)! 1 
T 


dyse—D-al 


: ing —ing n—1 
sin (n ev"? —e Sicilia 
Clearly on iow ft een a 
sin ¢ ce —<~ p=0 
is a polynomial of cos g = 3(e’* + e “*) of degree n — 1, with the highest co- 
° n—1 a] MT . . 
efficient 2° . Forg = —,u = 1,---,n—1,sin (ng) = 0, sing ¥ 0, hence 
n 


sin (ng) 
sin ¢ 
nomial of degree n — 1 with the highest coefficient 1. Consequently 


, as a polynomial in cos ¢, has the same roots as A(y). Wy) is a poly- 


1 sin (ng) 

(eos ¢) = a 4 = 

2”! sin ¢ 
This formula allows one to compute %(y) quickly, examples are 
n=3: Ay =y —- 


n= 5: Aly) 


mo 


3,/2 1 
4Y + ve, 


= 9 g 4 3,,2 
n=7: Ay) =y — sy + 2y — &. 


I 

oS 
rs 

| 


The number of odd intervals, on which integrations must be carried out, 
is $(n — 1), but since those which are symmetric with respect to 0 require the 
same computations, only }(m — 1) or }(n + 1) must be considered. So there are 
1,1, 2, --- such intervals for n = 3, 5,7, --- respectively. The integrals are 
first elementary (aresin), then elliptic, then hyperelliptic. 

Numerical computations for n = 3 are immediate; for n = 5, 7 they have 


been carried out with considerable precision by B. I. Hart. 
$(n—1)—1 
d : ' 1 
Aty = B,, dy w(y) has a singularity of the type vis =B, | (ef. the end 


k 


of section 5), while all Say), k =0,1,---, $(n — 1) — 2, are continuous. 
y 



















)=- 


e 


nd 


Ss. 
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At y = B, and B,_1, in particular, they are zero. 
k 


integrations that the order of vanishing of = w(y),k =0,1,--- 
y 


Hence it follows by successive 


at y = By and By is (nm — 1) — 1)—k-4=5-2-k. 


5 In particular 


for k = 0 we find that at its maximum and at its minimum (2, and B,_1, 


i.e. -& COS _ the order of vanishing of w(y) is 5 — 2.* 


Since w(y) has this property, and since it is obviously an even function of y, 
R. H. Kent has suggested approximating it by a series expansion of the form 


°o e 3n—2+h 
wy) = da (co: —-_— i’) ’ 
h=0 n 


Computations by B. I. Hart, not yet published, have shown that even the use 
of the first four terms (h = 0, 1, 2, 3, the a, being determined by the condition 
of normalization and by the first three even moments of the actual distribution 
given in section 4) give excellent approximations. The use of the formula (3) 
suggests itself likewise for even values of n. 


(3) 










7. Second special case. We consider now the second special case, where 
B, = B, > Bs = By >--- > Bn = Bn. This has no immediate bearing on 
our original problem (cf. the preceding section), but we shall nevertheless discuss 
it for the two following reasons. First, it is hoped that the reader will find an in- 
dependent interest in the simple and complete results which can be obtained in 
this case. Second, there are various modifications of our original problem, which 
lead to this case. For example let the x,, --- , x, in our original problem, as 
described in section 1, be complex numbers instead of real ones, replacing all 
squares by absolute value squares. Then one verifies easily that all character- 
istic values \; , --+ , An—1 are doubled, and so our first case goes over into our 
second case. (This amounts to replacing our quadratic forms by Hermitian 
forms, cf.*) It is easy to imagine two-dimensional problems where this set-up 
is natural. 



















™ , 90 that C, > C2 > ++ > Cin 


We put C, = Ba, = By forrA = 1,---, z :, 


are the only restrictions imposed. 


“very yin B, = y 2 By, , i.e. in C; 2 y 2 Cym, lies in an interval C, = y 2 
3m—1 
That is the v of (2) is always even, and so aren w(y) 
Y 


Therefore w(y) is a polynomial of degree 


Chas 1.e. Ba = y 


IV 


Boyz1 « 


is zero in every one of these intervals. 


m . , 
= — 2 in every one of these intervals. 


9 We have already shown that w(y) is 


18 We omit the simple discussion of n = 3, which must be excluded from this result. 
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not the same polynomial in each interval. Thus w(y) is represented by .* 1 


; m ' m , 
polynomials of degree — — 2 in the — — 1 intervals 


C2y2C., C,.2y Oe, +> eee BOS Con. 


We could try to obtain explicit expressions for these polynomials by a direct 
application of the results at the close of section 5. A characterization of the 
distribution can, however, be obtained in a more elegant way by an indirect 
procedure. 

Consider an arbitrary function §(y). We wish to express its mean 

Ci 


By) = | Boy) dy. 


Cim 
If we can do this for all §(y) then the distribution is completely characterized. 
We select first an > — 1-fold primitive function of §(y), ie. a function G(y) 
with 


dyn! Gy) — o(y). 


' , i ; m 
Of course @(y) is determined only up to an additive polynomial of degree 27 2 


in y. 
Now the above expectation value becomes 
—_——— Cy qi" 1 


o ay G(y)w(y) dy 


oly) = 


3m—1 C,—90 qe 
5 = ~ G(y)w(y) dy. 
C 


A=1 orx+14+0 dy*™ 1 


Since all a w(y),k =0,1,---, 3 — 2, are continuous from C),; + 0 to C, — 0 
m 


r all = ‘ 
for all 19 


— 1, we can evaluate each integral of the above sum by 


m . — ; y , ‘ 
— — 1] successive partial integrations. Thus the following expression obtains: 


ee cy Sow awl 


y=C,+1+0 


q?"- 1 


of 6) Fa 


w(y) dy. 


Considering the definition of @(y) as an 5 
d* 


— 1-fold primitive function, the 


G(y), k’ = 0,1, ---, 3 — 2, are everywhere continuous. This corresponds 
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m 


“sa * 2. Hence the first line can be rewritten 

















y=C,+0 


! }m—k—2 k 
-2 = -v'{ ow} {Low} 


y=C),—0 


. j d* , 
(F or Cy = C; or Cym the dy w(y) at y = C; + Vor Cyn — 0, respectively, must 


obviously be taken to be zero.) Owing to the results of section 5 all terms 





m ‘ . m ; 
_ 3 vanish, and the term with k = _7 2 gives 


_ 


bm oo 7 ™ ‘ 
-¥ (= 6D" (% - 1) tg Fee 
II (Ci. — Cy) II (C, — C,) 
a k=\+1 
ae > m 1 
=X (9 (2-1) oo 
IT (c. — ey) IL ( - cy) 


=\+1 


with k = 0,1, ---, 


G(C)). 


4m—1 
‘ ‘ . t , 
The second line vanishes, since jt w(y) is zero everywhere, as observed above. 
¢ y m— 
Finally 
a = r-1 fm 1 
BW) = (14 ("= 1) ty . @(CY). 
Ili.-c) I] ( - cy 
k=l k=\+1 






im 


I] (2 — C;) 


k 


B(z) 


we have 


[a B(2)| = [Tue (Cr — Ci) 
dz z=Cy k=l 
A-1 4m 
= (—1)*" J] ( — G) II c — Cy. 
k=l k=d+1 
Therefore the above formula can also be written 
ies 4m 
_ C 
iy) = (7 - 1) 13, pO _. 
2 FASE ee! 
\dz Loum, 


Observe that the right-hand side of the above formula (which can also be 
easily expressed in terms of determinants) is a well-known approximate ex- 
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4m—1 


pression for dy G(y), as a (repeated) difference quotient of the values G(C,), 


“1. It is therefore very satisfactory that this expression gives the 


mean of 


km—1 


S(y) = 2 


dy G(y). 


Appendix. We return to the normal distribution of 2, --- , 2, as described 
in section 1, and to the quantities s’, 6, » given there. We denote means with 
respect to that distribution by (--- ). 


It was observed by B. I. Hart and mentioned by J. D. Williams’ by com- 
2 
paring the known expressions for their moments, that every moment of 7 = ; 


is the quotient of the corresponding moments of & and of s*. That is 
(p 0, 1, 2, aD 


This indicates some kind of independence relation involving 5 and s*. The 
considerations which follow are intended to clarify this situation. 
The above relation may be written 


2 2 
gs? 4? “a s? n”, 


or, more generally, 
stn? = 87’. 
We shall prove this by showing that s and 7 are statistically independent. 
We can, as in section 2, make the mean ~ = 0, i.e. obtain the 7, +--+, 2, 
distribution law 


n 
—S 22/202 
Ce we” day +++ dda. 


And then, again as in section 2, perform a linear orthogonal transformation, 

. . , / . ° . . . 
carrying 2, --- , 2, into, say 2, --- , Z, which leaves the distribution law in 
its original form 


and makes 
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° , : 2 , ’ ° ° 
Since x, does not occur in s’, » we must use only the 21, --+ , Z,-1 distribu- 
tion law 


n—1 
n--1 — > 212/262 ! , 
Cc € w= * da, +++ dtn-1. 


+ . . . / , 
Now we introduce polar coordinates with respect to 71, ---,2%,-1. These 
consist of a radius r with 


“u? 


and n — 2 angular variables ¢,, --- ,¢n-2, which can be chosen in various 
ways, and which we need not describe more closely. At any rate 


’ ’ i 
dx, +++ dt,1 = 1" “ drw(gi, +++ , Gn-2) dor +++ dens 


where we need not determine the weight function w(¢:, --- , ¢n-2). Conse- 
quently the distribution law is 
Pre drw(gi, +++ , Gn-2) dgi +++ Agnes. 


Thus the coordinate r and the coordinates g; , --- , gn-2 are independent of each 
other. 
Next 


° . / , . . ° ° 

and 7 is a homogeneous function of 2; , --- , Z,-1 of degree zero, i.e. it is inde- 
pendent of r. So sis a function of r alone, and 7 is a function of g1, --- , gn-2 
alone. Consequently s and 7 likewise are independent. 


Added in proof: 

After this manuscript was completed, Dr. T. Koopmans informed the author 
of several results of his own, which he obtained in connection with other statistical 
investigations. They have many points of contact with this investigation, and 
will appear in the near future in the Annals of Mathematical Statistics. The 
author wishes to express his thanks to Dr. T. Koopmans for his communications. 





SOME EXAMPLES OF ASYMPTOTICALLY MOST POWERFUL TESTS 


By ABRAHAM WALD! 
Columbia University 


1. Introduction. In a previous paper’ the author gave the definition of an 
asymptotically most powerful test and has shown that the commonly used tests, 
based on the maximum likelihood estimate, are asymptotically most powerful. 

In this paper some further examples of asymptotically most powerful tests 
will be given. Let us first restate the definition of an asymptotically most 
powerful test. Let f(z, 6) be the probability density of a variate x involving an 
unknown parameter 6. For testing the hypothesis 6 = 6) by means of n inde- 
pendent observations 2; , --- , 2, on x we have to choose a region of rejection 
W,, in the n-dimensional sample space. Denote by P(W,, | 6) the probability 
that the sample point FE = (2,, --- ,2,) will fall in W, under the assumption 
that @ is the true value of the parameter. For any region U’, of the n-dimen- 
sional sample space denote by g(U,) the greatest lower bound of P(U, | 4). 
For any pair of regions U, and T,, denote by L(U, , T,) the least upper bound of 


PU, | )) — P(T,, | @). 


In all that follows we shall denote a region of the n-dimensional sample space 
by a capital letter with the subscript n. 

Definition 1: A sequence {W,} (n = 1, 2, --- , ad inf.) of regions is said to be 
an asymptotically most powerful test of the hypothesis 6 = 4 on the level of 
significance a if P(W,| 6) = a@ and if for any sequence {Z,} of regions for 
which P(Z,, | 6) = a the inequality 

lim sup L(Z,, Wn) < 0 


n—?>o 


holds. 

Definition 2: A sequence {W,} (n = 1, 2, --- , ad inf.) of regions is said to 
be an asymptotically most powerful unbiased test of the hypothesis 6 = 4 on 
the level of significance a if P(W,, | 6) = lim g(W,,) = a, and if for any sequence 


{Zn} of regions for which P(Z, | 6) = lim g(Z,) = a, the inequality 


lim sup L(Z,, W,) < 0 


no 
holds. 
1 Research under a grant-in-aid of the Carnegie Corporation of New York. 


2 ““Asymptotically most powerful tests of statistical hypotheses,’’ Annals of Math. Stat. 
Vol. 12 (1941). ' 
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Consider the expression 









1 <a 
1 n(@) = —= — log f(r, 4). 
(1) (0) = De 5 low fre, 6) 
Let W;, be the region defined by the inequality y,(0) > c, , W., defined by the 
inequality yn(%) < c, , and W,, defined by the inequality | y,(0) | > c, , where 
the constants c), , c, and c, are chosen such that 


P(W%, | 0) = P(W% | 60) = P(Wa| %) = a. 





It will be shown in this paper that under certain restrictions on the probability 
density f(x, 0) the sequence {W/,} is an asymptotically most powerful test of the 
hypothesis @ = 4 if @ takes only values > 6. Similarly {W+,} is an asymptot- 
ically most powerful test if @ takes only values < 6. Finally {W,} is an 
asymptotically most powerful unbiased test if 6 can take any real value. 
Another example of an asymptotically most powerful unbiased test of the 
hypothesis 6 = 4, as it will be shown, is the critical region of type A in the 
Neyman-Pearson theory of testing hypotheses. This fact gives a strong justifi- 
cation for the use of the critical region of type A. 






















2. Assumptions on the density function. Let w be a subset of the real axis. 
Denote by 6* a real variable which takes only values in w and let 6 be a variable 
which can take any real value. For any function ¥(x) we denote by Ea/(x) the 
expected value of ¥(x) under the assumption that 6 is the true value of the 
parameter, 1.e. 


Byy(e) = [ va)sle,0) de 


For any 2, for any positive 6 and for any real value 6, denote by ¢;(z, 6; , 6) the 
greatest lower bound, and by ¢e(z, 6: , 6) the least upper bound of x log f(z, 6) 


in the interval 0; — 6 < 6 < 6, + 6. In all that follows the symbol 67 , for 
any integer i, will denote a value of 6*, i.e., 6; is a point of w. 

We say that a value 6 lies in the eneighborhood of w if there exists a value 6* 
such that |@ — 6*| < «. 

Throughout the paper the following assumptions on f(z, 6) will be made: 

AssuMPTION 1: For any pair of sequences {6,} and {02} (n = 1,2, --- , ad inf.) 
for which 


. 0 * 
lim Eo, a5 log f(z, @,) = 0 





lim (6, — 6-) = 0. 


n=o 
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Furthermore there exists a positive «€ such that B,| 2 log f(z, a) | is a bounded 
function of 6 and 4,, Ey 5 108 f(x, 61) ts a continuous function of 6 and 6, and 


Es, E log f(z, ay] = d(61) has a positive lower bound, where 6, can take any value 


in the «neighborhood of w. 

AssUMPTION 2: There exists a positive ko such that Ep,gi(z, 61, 5) and 
Eo.g2(x, 01, 6) are uniformly continuous functions in the domain D defined as 
follows: the variables 6, and 0, may take any value in the ko-neighborhood of w and $ 
may take any value for which |5| < ko. Furthermore it is assumed that 


E,,[¢i(z, A; ’ 5), (2 — 1, 2) 


are bounded functions of 6, , 62. and 6 in D. 
ASSUMPTION 3: There exists a positive ko such that 


+00 a +00 a 
| 5S, 6) dx = [; "S(t, 0) de = 0 
for all @ in the ko-neighborhood of w. 


Assumption 3 means simply that we may differentiate with respect to @ under 
the integral sign. In fact, 


+00 
[ f(x, 6) dx = 1, 
identically in @. Hence 
2 [10 tan” [se 6)dx =0 
ai. *”  — 


Differentiating under the integral sign we obtain the relations in Assumption 3. 
AssuMPTION 4: There exists a positive ko and a positive n such that 


a 2+n 
Es 2 og f(z, 0) | 
is a bounded function of 0 in the ko-neighborhood of w. 


3. Some propositions. Proposition 1: To any positive B there exists a posi- 
tive y such that 


lim 0 PS | yn(O*)| > | ah =i 


uniformly in 6* and for all @ for which | 6 — 6* | . B. 


Proor: From Assumption 1 it follows that Eo =, ©, log f(x, 6*) | has a positive 
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lower bound in the domain | @ — 6*| > 8. Since according to Assumption 1 
B,| 3 log f(z, 0 | is a bounded function of 6 and 6*, Proposition 1 easily follows. 
PROPOSITION 2: There exists a positive € such that 


lim P[yn(6) < t | 6] = N(t! 6) 
uniformly in t and for all @ in the «neighborhood of w where 


3 


@) d(0) = — By & log ste, 0) = Bs] 9 tog sc, 0 | 


-4y2/d(6 
(3) it me [. ePNA® gy 
/ 2nd (6) 
Proposition 2 follows easily from Assumptions 3 and 4 and the general limit 
theorems. 
ProposiTION 3: There exists a positive e€ such that for any bounded sequence {un} 


( t ‘ } 
lim ) P EO <t|o+ +. | — [eet aN(o|o)p = 0 


n=s n / 


\ 


uniformly in t and for all 6 in the neighborhood of w. 
Proor: We have 


(4) no ts) = yn(0) + 2x “log f(ea, 0%) 


where @;, lies in the interval |e, 6+ #.| . From Assumption 2 and the above 
n 


equation we easily obtain 


oe (P| v.(0 + ++) ee | 


—P EO — prd(6) < t| e+ #-|\ =o 


/ 


uniformly in ¢ and for all 6 in the eneighborhood of w. From Proposition 2 
and (5) we get 


lim f dN(v| 0) — P| ya) <t+ und(6) | 6+ — +} = = 0 


n=20 


li [ me aN | Mn ad 
im 4 | dN(v |) —P nah < 818+ <> = 0 
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uniformly in ¢ and for all @ in the eneighbourhood of w. 
sition 3. 



















This proves Propo- 


ProposiTIon 4: There exists a positive € such that for any positive y and for 
any sequence {un} for which lim | ura| = © 
n=co 


ba el ) u 
lim Ps | yn(6*) | >y|e* + — > = 1 
n=o \ VJ n. 
uniformly in 6*. 


Proor: If there exists a positive 6 such that | zs, | > 6 for almost all n, 


n | 
Proposition 4 follows from Proposition 1. Hence we have to consider only the 
— a: 
case lim —— = 0. Since 


n=o0o /n 


Bee¢(univa) Yn G + +) = 0 
+ a/n ? 
we get from (4) 


a” ' 
De 592 108 (tas On) 


(7) Eo9+4-(unt/n)lYn(O*)] + pn Boe (unl /n) amie gaara = 0. 


‘ i Ln i 
Since lim a = 0, we have on account of Assumption 2 
n 


a ’ 
» ae? log I(x , 6n) a 





lim Eos ual/n) 


n= nr 


9 
« 


7, * -_ * 
= -Be| 3 log f(x, 6 | = —d(*) 


uniformly in 6*. According to Assumption 1 d(@*) has a positive lower bound; 


hence on account of lim | u,| = © we obtain from (7) 
(8) lim | Loe (un/y/n) Yn(O*) | = 2 


n=o 





, , ‘ . . . 0 
uniformly in 6*. The variance’of y,(6*) is equal to the variance of 30 log f(x, 6*). 


On account of Assumption 1 the variance of M log f(x, 6*) (under the assumption 


that 6* + Va is the true value of the parameter) is a bounded function. Hence 
n 


Proposition 4 is proved on account of (8). 
Proposition 5: Let {W,(6*)} be a sequence of regions of size a, 1.€. 
P[W,,(6*) | 6*] = a, and let V,(6*, y) be the region defined by the inequality 
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yn(O*) <y. Let U,(6*, y) be the intersection of V,(6*, y) and W,,(6*) and denote 


P[U.(6*, y) | 6*] by F.(y | 6*). Denote furthermore P| wo" | * + J by 


G(e*, w, n). If {9n} and {un} are two sequences such that lim d(6>) = d; 


n =00 


lim F,,(y | 6°) = F(y) and lim pn, = p then 
a _ 2 
lim G@%, n,n) = [ot ™4ar@). 
Proor: Let lim yu, = yw and consider the Taylor expansion 


* Hn _ = * Hn ¢ * 

7 2X log f(z 6* + Se) X log f(a, 6*) + a dX a5 log f(a, 6*) 
1p, a ' 
+ 3 . ap dX log f(a, On) 


aia i : Mn . : 
where 0, lies in the interval Gg 6* + =} From this we easily get on account 
n 


of Assumption 2 and the fact that {u,} is bounded 


Mn 
n i( te, * + - -) 
(10) log [I - vin 


————<—<$<—<—$< _———— = “ . 2.8 * # » 
=1 Ie; 6*) Mn yn(O ) aMn d(@ ) a. (0 : n) 


where for arbitrary positive 7 


(11) lim P { e(0*, n)| << | O* + S.\ =] 


uniformly in @*. Denote by R,(6*) the region defined by 
(12) | €(6*,n)| <> 0. 


On account of (11) we have 
. * m" Hn i 
(13) lim Pp Ex )\o*+ S.| 1, 


uniformly in 6*. Denote the intersection of R,(6*) and W,(6*) by Q,(6*), and 
the intersection of R,(6*) and U,(6*, y) by 7T,(6*, y). Furthermore denote 
P[T,.(6*, y) | 0*] by F.(y | 6*). Then we have 


t 
e” | etnu—huy, a(6*) dF ,(y | 6*) < P EG Net Wa 
(14) r 


t 
< e” I etny—teh d(6*) dF ,.(y | 6*) 
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for all values of ¢ - 6*. Furthermore we obviously have 


vl wt 


aad 7 


lim < Gos Un, n) —-P 2. (0*) | @* + 


n=o 
uniformly in 6*, and 


(16) lim [F,,(t| 6*) — F,(t 


n=cO 


|6*)] = 0 














uniformly in 6* and ¢. 
(14) and (1 


Since 7 may be chosen arbitrarily small, it follows from 
5) that to any e > 0, 7 may be chosen such that 


(17) | ont OD GB (| 9*)| < £ 


° Vin® 
lim sup | G(6,,, un, n) — 


n=00 


lA 


for anv s {9*) 
or any sequence {6, | 


To each ¢ let L, be a positive number such that L,. depends only on ¢ and 


7 2 d(@*) y = 1.2 a(A* : 
(18) | etme 4) AN (t | O*) + i ent uh a) INTE | O*) $5 
— oO L, 


for all n and for all values of 6*. Since d(6*) has a positive lower and a finite 


upper bound, it is easy to verify that such a L, exists. From (18) and Proposi- 
tion 3 it follows 


lim sup {p EEGs < —L,|\65 + | 
n> \ V/n 


+ P| elo) > L, | On + 








(19) 









“Al <5 


for any arbitrary sequence 9°}. Since the difference U,,(6*, t:) — U,(6*, t) is 
a subset of the difference V,,(6*, f2) — V,(0*, t:) and since T,,,(0*, t2) — T,(6*, th) 


is a subset of U,(0*, f&) — U,(0*, 4) for tg > t,, we get from (18) and (19) 








li ( > y * | * Mn Bn 
1m sup e l n(O, ; —L,) | 6, + wn + P W (0 ) | 6; + ale 
noo n 











(20) 
_ Pie (0. ’ L,) | 6. + #.]} < 2 
and 
im sup {P| T.(6%, — | + Pla. (07) |6, + | 
(21) nro n a 


— P| ra ’ L.) | 0%, . | 
Vn 


* 
for any sequence {6,}. 


—L, 
e " lim sup qT 


On account of (14) we get from (21) 


e nt —hu2d(6%) dF, (t | 6 *) + [ ebnt— —hu2d(6*) dF, (t | mh <6. 






(22) 







rr) 


= — 
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From (17) and (22) we obtain 
Be n 
(23) lim sup G(O% , un, n) — | ent ont) IP (t lax)\ < 4*3*) 
n—2 LL. 


eo eee \@,}. Consider now the sequence {6} which satisfies the 
conditions of Proposition 5. Since F’,(t| 6) converges to F(t) uniformly in t, 
on account of (16) also F,,(t| 62) converges to F(t) uniformly in t. Hence we 
obtain from (23) 


n—->oo 2 


Since « and 7 may be chosen arbitrarily small, Proposition 5 follows from (24). 


} Le n 
(24) lim sup | G(@,, wn, ”) — | eft —w*d IF (t) | < (it*). 
= 


4. Some theorems and corollaries. THEOREM 1. Denote by S,(0@*) the region 
defined by the inequality y,(6*) > A,(@*) where A,(6*) ts chosen such that 
P{S,(0*) | 0*] = a. For any region W,,(0*) denote by L,[W,(6*)] the least upper 
bound of P{W,,(6*) | 6] — P[S,(0*) |.6] with respect to 6* and 0, where 0 is restricted 
to values > 6*. Then for any sequence {W,,(6*)} for which P[W,,(6*) | 6*] = a, 

lim sup L,[W,(6*)] < 0. 

Proor: Assume that Theorem 1 is not true. Then there exists a sequerice 

of integers {n’}, a sequence 9") anda sequence {6n°} (0.7, = 6,) such that 


(95) “lim { P[Wn-(6*+) | On] — P[Sn(0*-) | Onl} = 6 > 0. 


n=eo 


*,¢ . K . 
On account of Proposition 2 and Assumption 2 the sequence {An/(@,-)} is 
bounded. Then it follows easily from (25) and Proposition 4 (taking in account 


that EB, < log f(x, 6*) > O for 6 > 6* 


(26) (nr — Onn! = unr > 0 
must be bounded. Denote by {n’’} a subsequence of {n’} such that 
(27) lim d(6,.) = d 
(28) lim pny = pw, and 
(29) lim F(t | 02+) = F(t) 
uniformly in ¢ where 
F(t | *) = P[U,(6, t) | 6*] 


and U,,(6*, t) is the intersection of W,,(6*) and the region y,(@*) < t. The exist- 
ence of a subsequence {n’’} such that (29) holds follows from the fact that 


(30) F i (te | 6*) —_ Filth | 6*) < ®,,(te | 6*) = ®,(t; | 6*) for ty > ty, 
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and 
. * 1 : —hy2/d ry P 
(31) lim &4(t | 62--) = —— | e**" dv = NO, t 
n =00 V/ 2d ——_ a 
where ®,,(¢ | @*) denotes the probability P[y,(@*) < t| 6*]. Furthermore it can 
easily be shown that 
+00 
(32) [ a@=a 
On account of Proposition 5 we get from (25), (27), (28), (29), (30) and (31) 
00 ) 
(33) [ eft ed F(t) — | eft "4 IN(t) = 6, ; 
i— OO A 
where A denotes a value such that 
| dN(t) = a. é' 
A 
It has been shown in a previous paper’ that (33) leads to a contradiction. Hence | 


Theorem | is proved. 

THEOREM 2: Denote by S,(0*) the region defined by the inequality y,(6*) < 
A,(6*) where A,(0*) is chosen such that P{S,(0*) | 6*] = a. For any region 
W,,(0*) denote by L,[W,,(0*)] the least upper bound of 

P(W,,(6*) | 6] a P[S,,(6*) | 4] ; 


with respect to 6* and 0, where 6 1s restricted to values < 6*. Then for any sequence 
{W,,(0*)} for which P[W,,(0*) | 6*) = a, 


lim sup L,{[W,(6*)| < 0. 


n->oo 


The proof is omitted, since it is analogous to that of Theorem 1. 
THEOREM 3: Let {W,(0*)} be for each @* a sequence of regions for which 
P[W,,(6*) | 6*] = aand lim g[W,,(6*)] = a uniformly in 6*. Denote by L,[W,(6*)] 


n=o 


the least upper bound of 
P[W,.(6*) | 6] — Pl| yn(6*) | > An(6*) | 6] 
with respect to @ and 6*, where A,(6*) is chosen such that 
P{| yn(O*) | 2 An(6*) | 6*] = a. 
Then 
lim sup L,[W,(6*)] < 0. 


no 


3 See p. 12 of the paper cited in ?. 





an 


ce 


1/\ 


m 


ce 
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Proor: Denote P[y,(@*) < ¢| 6*] by ®,(t| 6*) and denote by F,(t| 6*) the 
probability (under the hypothesis @ = 6*) of the intersection of W,(6*) with 
the region y,(@*) < ¢t. Assume that Theorem 3 is not true. Then there exists 


* 
a subsequence {7’’}, a sequence {6,--} and a sequence {6,--} such that 


lim dz) =d; lim (6,77 — 0% +)4/n” = lim pn’ = ps; 


lim Fy(t| 02+) = F(2) 


uniformly in ¢, and 


to » 
(34) [ ett —wd OF (t) -[ 


where A is a positive number such that 


san . 
| dN(t) = as and N(t) = e | e "ld dy. 
_ 2 V 2rd H-« 
This can be proved in the same way as (33) has been proved. The author has 
shown in a previous paper’ that (34) leads to a contradiction. Hence Theorem 
3 is proved. 
THEOREM +4: Denote by A,(6*) the region of type’ A of size a for testing the hy- 
pothesis 86 = 6*. Denote by B,(6*) the region | y,(6*) | > C,(6*) where C,,(6*) 
is determined such that 


eft -te"4 IN (1) — | ett "4 IN (t) = 6 
A 





P{| yn(6*) | = Cn(8*) | 6*] = a. 


> 


2 2 
Then, under the assumption that Eg 5 log f(z, | is bounded, 


lim { P[A,.(6*) | 6] — P[B,(6*) | é]} = 0 


n=O 


uniformly in @ and @*. 
Proor: The region A,(6*) is given by the inequality® 


a a 
|= 30 log f(ta, @ | 
(35) sa 
+> si log f(ta, 6*) > kn(6*) b > log f(r, 0 | + ki (6*), 


where k;,(6*) and k,,(6*) are chosen such that A,(6*) should be unbiased and of 
size a. The inequality (35) can be written also in the form 


(86) yal") + — DF log flare, 6°) > Ua O*)ynlO") + 100"), 


4See p. 14 of the paper cited in 2. 

5’ Neyman, J. and Pearson, E. S., ‘Contributions to the theory of testing statistical 
hypotheses,’’ Stat. Res. Mem., Vol. 1. 

6 See the paper cited in °. 
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Let {un} be a bounded sequence. From Assumption 2 it follows that for any 
positive e 


” > if | a ttn we ** | gx Ma \ 
(37) PY | 5 Le age lB Sea, O*) + a(o") <€!6 + tha 


uniformly in @*. Since (37) holds for arbitrarily small «, we get easily on ac- 
count of Proposition 3 


( ) 
(38) lim P| An" | @* + ee. | — Pp | (0*) | o* + Al = 0 
n=00 /n /n ) 


uniformly in 6*, where A,,(6*) is defined by 

(39) [yn(O*)P > 1, (6*)yn(6*) + Tn (6*) + d(6*). 

Since A,,(6*) is unbiased and of size a, we have on account of (38) and (39) 
(40) lim U,(6*) = 0 and 

(41) lim 1 (6*) + d(0*) = X(6*) > 0 


uniformly in 6*, where \(6*) is given by the condition 


—}t2/d(8*) 
e€ dt =a. 


] VX(0*) 
(42) | 
V/ 2rd (6*) —VX(0*) 
Inequality (39) is obviously equivalent to the simultaneous inequalities: 
Yn(O*) < c,(6*) and Yr(O*) > c, (6*) 
where c,(6*) and c,,(6*) are the roots of the equation in y,(6*) 


[yn(O*)] = U.(0*)yn(O*) + U%(8*) -+ d(6*). 
Since 
lim c;,(6*) = — /X(6*) and 


uniformly in @*, from Proposition 3 it follows that 


r= 


li ( > *\ | ox - Mn 
im r | 400 |e +"— | 


Va 
—Vx(0") ie 
— [eee ance — | 


ee) +V}(6*) 


(43) 


2 


elint—4u2d(6*) dN(t | o)} = 0 


uniformly in 6*. 
Vn 


Vn 


Now let us consider a sequence {v,} such that lim | », | = © and lim 
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We shall prove that 


y P| A,(6* * Pa a 
(44) | (0*) | 6 + | 1 


2 2 
’ . : a ; 
uniformly in 6*. Since |S log f(z, | is assumed to be bounded, 


3 
(45) Eo++ (n/ vm) 5 log f(x, | 


and 


3 2 
(46) Eos (onl/m) 5 log f(z, | 


are bounded functions of 6* and n. We get by Taylor expansion 


0 . ee Q P = 
Do 9g 1B Sta, 6*) = Le 5, tog f (x0 ied va) 
(47) 
Vn 


a . 
~ Wa z 562 log f(ra, 6*) 


0 as Vn 
where 6, lies in G g* + 2. | Hence 
n 


1 a . 
(48) Eog+4-(n/a/n)LYn(O*) | = —Vn Eoo+4-(vn/4/n) E ae? 7 log S (a ’ an) | . 


From Assumption 2 and lim |», | = « it follows that the absolute value of 
the right hand side of (48) converges to «. Hence 


lim | E6+s-n/-/nYn(O*)] | = ©, 


Since on account of Assumption 1 


a 2 
Boer +(on//r) 3 log f(ta; | 
is a bounded function of n and 6*, also the variance of y,(@*) (under the assump- 


tion that 6 = 6* + »v,/+/n is the true value of the parameter) is a bounded 
function of n and 6*. Hence for any arbitrary large constant C 


(49) lim P| yn(O*) | > C\e* + oe = 1, 


uniformly in 6*. The equation (44) follows easily from (36), (40), (41), (45), 
(46) and (49). 
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| Pn | ; 
Consider a sequence {p,} such that Tr >68>Oforalln. Then it follows 
easily from Proposition 1 that for any arbitrary C 


i Pn 
>} lyn) | > Chor + | = 
(50) lim F [zay = Cig &. | 1 


9 


; a a : - 
uniformly in 6*. Since Bl log f(Xa , 0 | is assumed to be bounded, and 


therefore also E, a log f(x, 6*) is bounded, there exists a finite g such that 


‘ (\1 a Pn | 
(51) lim Py _ dX a2 log f(a, 0*)| < g|o* + afal =] 


/ 


uniformly in 6*. From (36), (40), (41), (50) and (51) it follows 
° i : Pn 
rl r(O*) | @* - —~ |=] 
(52) lim J | 4 ,(0*) | O* -+ vA 


uniformly in @*. Since on account of Propositions 3 and 4, the relations (48), 
(44) and (52) hold if we substitute B,(6*) for A,(0*), Theorem 4 is proved. 
If Assumptions 1-4 are fulfilled for the set w consisting of the single point 
6 = 4, then we get from Theorems 1-4 the following corollaries: 
Coroutiary 1: Let W;, be the region defined by the inequality yn(@%) > i 
W', defined by the inequality y,(0) <c,, and W,, defined by the inequality 
| yn(O0) | > cn, where the constants c, , C, and ¢, are chosen such that 


P(W%, | 6) = P(W: | 6) = P(W2| %) = a. 


Then {W),} is an asymptotically most powerful test of the hypothesis 6 = 0 if 8 
takes only values > 6). Similarly {W',} is an asymptotically most powerful test 
if 6 takes only values < 0. Finally {W,} ts an asymptotically most powerful 
unbiased test if 6 can take any real value. 

Coro.Luary 2: The sequence {A,(0)} ts an asymptotically most powerful un- 
biased test of the hypothesis 0 = 0), where An(0) denotes the critical region of 
type A for testing 0 = 6. 





ON THE DISTRIBUTION OF THE QUOTIENT OF TWO CHANCE 
VARIABLES 


By J. H. Curtiss 
Cornell University 


1. Introduction. Although the quotient of two chance variables appears fre- 
quently in mathematical statistics, the methods used in the literature to derive 
the distributions of quotients have usually been special ones devised for the 
particular variables under consideration, and in no way indicative of the general 
result. It is the purpose of this paper to study the distribution of the quotient 
of two variables for itself alone, with attention first to the question of existence, 
and then to the accurate derivation of a number of general formulas for the 
frequency function and d.f.’ The principal formulas which we shall derive may 
be described briefly as follows (the numerals refer to the equation numbers in 
the text): 

(3.1). The frequency function of the quotient of two variables which have an 
absolutely continuous joint probability function. 

(4.11), (4.12). The d.f. of the quotient of a pair of arbitrary independent 
variables, expressed in terms of the d.f.’s of these variables. 

(5.2). The d.f. of the quotient of a pair of arbitrary independent variables, 
expressed in terms of the c.f.’s’ of these variables. : 

(6.4). The limiting form of the d.f. of a quotient of two sums of arbitrary 
identical independent variables. 

(7.1). A formula analogous to (3.1) for the product of two chance variables. 

(7.2). A formula analogous to (4.11) for the product of two chance variables. 


2. The existence of the quotient distribution. The function Z = X/Y is a 
continuous function of X and Y, finite and uniquely defined for all points 
(X, Y) such that Y ¥ 0. Therefore if P{Y = 0} = 0, the pr.f.* P(S) of the 
joint distribution of X and Y determines a probability distribution for Z (see 
(1, pp. 12-13]). To avoid irrelevant difficulties, we shall assume in the sequel 
that P{Y = 0} = O unless definite statement is made to the contrary. This 
assumption involves no real restriction on our work, for in situations in which, 
a priori, the assumption is not fulfilled, we can always replace the distribution 


1L.e., distribution function. The underlying axioms, terminology, and abbreviations 
in this paper are uniform with those of Cramér’s book [1]. For the definition of d.f., see 
(1, p. 11]. 

? I.e., characteristic functions. See [1, p. 23]. 

3 T.e., probability function; [1, pv. 9]. 
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of Y by the conditional distribution of Y relative to the hypothesis that Y = 0, 
In such cases, then, the distribution of Z which we are about to study is to be 
interpreted as a conditional distribution relative to this hypothesis. 

We shall suppose that the space of X is the z-axis, that of Y, the y-axis, and 
that of Z, the z-axis. It is quite readily seen that the set of points in the (z, y) 
plane which corresponds to the set Z S z consists of 

(i) the infinite region* in the upper half-plane which is bounded by the nega- 
tive x axis and by the line x = zy; 

(ii) the infinite region in the lower half-plane bounded by the positive z-axis 
and the line x = zy; 

(iii) the line x = zy except for the origin. 
Denoting this set by S., we have 


H() = [ aP(s) = P(S», 


where H(z) is the d.f. of Z. The present paper, from the viewpoint of analysis, 
is simply a study of the Lebesgue-Stieltjes integral appearing in this equation. 


3. The continuous case. Suppose first that P(S) is absolutely continuous. 
This means that the joint distribution of X and Y has a frequency function 
g(x, y), which is defined almost everywhere, is non-negative, and has the prop- 


erty that P(S) = [ ee, y) dx dy. In general, this integral must be taken in 


the Lebesgue sense, but of course if the discontinuities of ¢ form a set of two- 
dimensional measure zero, and if the Jordan content of any bounded portion of 
the boundary of S is zero, then this integral is just an ordinary improper double 
Riemann integral.’ In particular, these conditions are fulfilled if ¢ is continuous 
everywhere and if S = S.. 

The transformation x = wv, y = v, gives a continuous one-to-one map of 8, 
onto a set S, of the (u, v) plane which consists of the closed half-plane lying to 
the left of the line u = z, but with the w-axis deleted. The Jacobian of the 
transformation has the absolute value |v|. By the theorem for change of 
variables in Lebesgue integrals [4, pp. 653-655], we have 


H(z) = | g(x, y) dx dy = | 'v | p(uv, v) du do. 


S 


By Fubini’s Theorem [6, pp. 203-208], the last integral can be expressed as a 
repeated integral. Integrating first with respect to v, we obtain this result 
THEOREM 3.1: If the joint variable (X, Y) has the frequency function ¢(2, y), 


then | 
H(z) = [ | | |v | our, v) av du, 


4 T.e., open connected set. 
5 See [4, pp. 476-478: p. 575]. 
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and consequently H(z) is an absolutely continuous function of z. The frequency 
function of the distribution of Z exists almost everywhere, and is given by the for- 
mula 


(3.1) h(z) = F(z) = [- |v | o(zv, v) dv. 


We remark that if X and Y are independent, so that g(x, y) = f(x)-g(y), 
where f and g are respectively the frequency functions of X and Y, then (3.1) 
may be written in the form 


(3.2) h(z) = [ ; | v | f(ev)g(v) do. 


This case was considered recently by Huntington [5], with the additional restric- 
tions that g(y) = 0, y < 0, and that f(x) and g(y) be continuous. 

All the familiar special quotient distributions of applied mathematical sta- 
tistics, such as Student’s ¢ and Fisher’s z, may conveniently and rigorously be 
derived by means of (3.1) and (3.2); in each case the required result follows 
immediately after an obvious change of variables in the integrand. We pause 
here only to point out explicitly the result obtained when X and Y have a normal 
joint distribution with variances ox : oy; , and correlation coefficient p. If the 
means E(X) and E(Y) are not equal to zero, it is apparently impossible to 
evaluate (3.1) in closed form; this case has been studied in some detail by 
Geary [3] and by Fieller [2]. But if E(X) = E(Y) = 0, then 


yo atevi=e, 
oy (: —p er) + ox(1 — p’) 
‘ 


which is the frequency function of a Cauchy distribution with mode at the 
point z = pox/cy , the value of the regression coefficient of X on Y. If X and Y 
are independent, then p = 0, and the frequency function becomes 


22 2° 
T Oy2 t+ox 


4, The quotient of two arbitrary independent variables. We shall hence- 
forth drop the restriction that P(S) be absolutely continuous, but shall suppose 
instead that X and Y are independent chance variables with one-dimensional — 
distributions of the most general type, except that the distribution of Y will be 
subject to the restriction that P{Y = 0} = 0. 

We denote the d.f. of X by F(x), that of Y by G(y), and, as usual, that of Z 
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by H(z). It is to be noticed that the condition P{Y = 0} = O implies that 
G(y) is continuous at the point y = 0. Let 


s@) = [ e aF() 








(4.1) cw =f ” ef aGly) 





g(t) = [ oY aGy). 





Clearly 


(4.2) 









H(z) = Pix —zY ©0;Y>0}4+ P{X —zY¥Y 20;Y <0}. 
We introduce the functions 


T\(u) = P{X —zY¥ <u; Y>0} =[1 —GQO)]-P{X —zY su! Y>0}; 


y(t) = [ e™ dTy(u), 


T.(u) = P{zY —X Su; ¥Y <0} = GO)-P{zY —X sSu| Y <9}, 





(4.3) 





ni) = [ ~ é ara(u), 





T(u) = Ti(u) + Tou) 





+ 00 


vO =f ear) = nO + nO. 





By (4.2) and (4.3), 
(4.4) H(z) = T(0). 














We shall now evaluate I',(w) and [:(u) in terms of F(x) and G(y), and also 
yi(t) and yo(t) in terms of f(t), g(t), and g (2). 

Let us assume for a moment that P{Y > 0} # 0; that is, that G(O) < 1. 
The conditional distribution of Y relative to the hypothesis that Y > 0 then 
has the d.f. 














(4.5) Gi(y) = ) 1 — G(O) 

(0, y <0. 
The d.f. of —zY relative to this hypothesis is Gi(—y/z) if < < 0, and 
1 — G,[(-—y/z) — Ol if z > 0. 























6 By P(A] b) is meant the conditional probability of the event A relative to the hy- 
pothesis b. 
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It is well known that the corresponding d.f. of the sum X + (—2zY) is given 
by a convolution of the d.f.’s of X and (—zY).’ In the present case, this result 
takes the form 


| F(u — v) ags( 47 < 0, 


ep 


(4.6) P{X —zY¥ <u|Y>0} = | 


Lf, Fu=2) afi -«(-2-0)], z>0. 


Referring to the definition of these Lebesgue-Stieltjes integrals [4, pp. 662-663], 
we see that the change of variables w = —v/z yields the equations 


J F(u + zw) dG,(w), z < 0, 
0 


(4.7) P{X —zY su|Y>0} = i: 
| F(u + zw) dG,(w — 0), z>0. 
0 


Now the definition of the variation of Gi(y) [4, pp. 341-342] used in forming 
these Lebesgue-Stieltjes integrals makes no distinction between the variation of 
G,(y) and that of Gi(y — 0) over any bounded set contained in an interval of 
integration a < y < ~, provided that G,(y) is continuous at a in the two-sided 
sense. Since G(y) is continuous at y = 0 in this sense, it is possible to replace 
G,(w — 0) by Gi(w) in the second of the two integrals in (4.7). 

Equation (4.7) is clearly true for z = 0 as well as for all other values of z. 
Referring to (4.5) and (4.3), we see that 


ria) I F(u + ew) dG(w), all. 
0 
The c.f. of the convolution (4.6) is the product of the c.f.’s of X and of the 
conditional distribution of —zY [1, p. 36]. This product is i: e *” dGy(y). 
0 
Thus by (4.5), (4.3), and (4.1), 


re) 
0 


as) nd = 1 - Go| s0-[ ae | = s09"(-w0. 


We have established (4.7) and (4.8) under the condition that P{Y > 0} # 0. 
However, it is obvious that they are trivially true if P{}Y > 0} = 0. 

We turn now to [.(u). Supposing that P{Y < 0} #¥ O, the conditional 
distribution of Y relative to the hypothesis that Y < 0, has the df. 


Gly) 
Go(y) = 0)’ 


7See [1, pp. 35-36]; also [7]. 
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The conditional distribution of zY has the df. Go(y/z) for z > 0, and 
1 — G.[(y/z) — 0] forz < 0. The df. of —X is 1 — F(—x — 0). Thus 


P{2Y —-X <SulY <0} = 


(fa -A-w-» -oaf1-a(2-o)], <0, 
[- a-rI-w-9-oyac(?), 2>0, 


0 
=l1-— F(zw — u — 0) dG,(w). 


Evidently the first and last members of this equation are equal for z = 0 as well 
as for all other values of z. From (4.3) we obtain 


0 
T.(u) = G(O) — | F(zw — u — 0) dG(w), all z. 


Also, as before, 
y(t) = f(—bg (et). 


Obviously, the last two equations are still true if P{Y < 0} = 0. 
To summarize, we have shown that 


© 0 
(4.9) T(u) = GO) + I F(u + zw) dG(w) -|[ F(zw —u — 0) dG(w), all; 


(4.10) v(t) = fg" (—2t) + f(—Og (et). 
Referring now to (4.4) and letting u = O in (4.9), we are able to state the 
following theorem: 

THreoreM 4.1: If X and Y are independent chance variables with respective 
d.f.’s F(x) and Gy), the df. of the quotient X/Y is given by the formula 








(4.11) H(z) = G(O) + [ F (zw) dG(w) — [ F(zw — 0) dG(w) 


for all values of z. 

We shall not attempt to make a careful study of the above formula, such as 
the studies which certain writers have made of convolutions. However, it does 
seem desirable to place on record here certain remarks concerning it of a more 
or less superficial character. For convenience in later reference, we state these 
remarks in the form of four lemmas. 

Lemma 4.1: Let M, be the set of all values of z such that if z ¢ M,, the set of 
discontinuity points of F(zw) on the w-axis has a point in common with the 


point spectrum of G(w). Then if z ¢C(M)),* the integrals I F(zw + 0) dG(w), 
0 








§ By C(M,) we mean the complement of M, with respect to the z-axis. 
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0 
[ F(zw + 0) dG(w), are Riemann-Stieltjes integrals and consequently the inte- 


grands can be replaced by F (zw) without altering the values of the integrals. 

The lemma follows immediately from the definitions of Riemann-Stieltjes and 
Lebesgue-Stieltjes integrals. 

Lemma 4.2: The set M, is denumerable. 

The proof can easily be supplied by the reader. 

Lemma 4.3: Let Mz be the set of all values of z such that if z « Mz , T(u) is discon- 
tinuous atu = 0. Then M,C M,. 

To prove this statement, we first observe that I'(w) is a genuine d.f. [1, p. 11]. 
For obviously T(— «) = 0, '(+«) = 1, and since I';\(u) and [.(u) are both 
products of d.f.’s into constants, these two functions, and therefore ['(u), must 
be continuous from the right. It is this last property of I'(w) which is needed 
for our present purposes; in particular, we have the relation lim, IF(u) = 
r(0) = H(z). On the other hand, by the general convergence theorem for 
Lebesgue-Stieltjes integrals [4, pp. 663-664], we have 


lim,—4—o '(u) = G(O) + [ F(zw — 0) dG(w) — [ F(zw) dG(w). 


If z be chosen so that this integral and the ones in (4.11) are all Riemann- 
Stieltjes integrals, the expression (zw — 0), wherever it appears, may be replaced 
by zw without changing the values of the integrals. Thus for such a value of z, 
r(+0) = Ir(—0). According to Lemma 4.1, we can be sure that at least if 
ze C(M,), the integrals here will be Riemann-Stieltjes integrals, so our proposi- 
tion is proved. 

Since H(z; + 0) is equal to '(+0) with z = z,, and H(z, — 0) is equal to 
l'(—0) with z = z,, we have the following result: 

Lemma 4.4: The set Me is the set of discontinuity points of H(z). 

By using the alternate form of the convolutions used to derive (4.9), we obtain 
a representation of I'(w) somewhat more complicated than that appearing in 
(4.9). The corresponding formula for H(z) is as follows: 


Cor — F(—0)] — GO)F() + [ a(2) aro) 
wo v : ; 
| - | G(2 - 0) arw - 0), z <0; 
(4.12) H(z) = _ — G(0)] + GO)[1 — F(—0)], 2 = 0; 


1+ GOLA — F(—0)] — Gore) + [ c(’) dF(v — 0) 


| 
| 
- [ @(2-0)aro, 2>0. 
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5. Representation of H(z) by characteristic functions.. A simple algebraic 
formula connecting the c.f. of Z with those of X and Y is not available. How- 
ever, there exists an interesting representation of H(z) in terms of the functions 
f(t), g(t), and g (t). The result may be stated as follows: 

TueoreM 5.1: Let the distributions of the independent variables X and Y have 
finite first absolute moments, and let the integral 


(5.1) ( [ 4 [-) fg" (—2) +f(-D9 CO) | 9, 
— 00 1 t 
be finite for each value of z. Let A(u) be any df. with a finite first absolute moment, 
—|} oo 
and let (/ + | ) - dt be finite, where 6(t) ts the c.f. of A(u). Then 
— 30 1 


™ fOg"(—2t) + f(—Dg &H) — 8 5 


= t 


(5.2) H(z) = A) — pa 


If the integral obtained by formal differentiation under the integral sign with 
respect to z in (5.2) is uniformly convergent in a certain interval J, then the 
frequency function h(z) of the distribution of z exists in that interval and is given 
by the formula 


h(z) = = [  UfOg'(—2) — f(—Dg"ebldt, ze. 


We remark that the condition (5.1) will be satisfied for all values of z if f(t) 
alone satisfies a similar condition, inasmuch as |g‘(f)| S 1, |g (| S 1. 
Important special cases of the theorem arise when A(u) is replaced by F(u) or 
G(u), and when A(w) is so chosen that A(O) = 0. 

Our proof of the theorem will depend on a rather general result due to Cramér 
[1, Theorem 12], which we shall restate here in the special form applicable to the 
problem at hand. 

Lemma 5.1: Let R(u) be a function of bounded variation over the infinite 
interval —2~x7 <u < o, let lim R(u) = lim R(u) = 0, and let r(t) = 


——o u—>+00 


+00 ; +00 . —1 oo ” 
[ e“ dR(u). If (a) |u| dR(u) and (b) ({ 4. / ) “ dt, both are 
— OD — oO =e 1 | | 
finite, then for every value of u, 


R(u) = — + ~~ 


—itu 
; e” @. 
2772 — 0 t 


To prove Theorem 5.1, we observe that since I'(u) is a d.f. (see proof of Lemma 
4.3), the difference T'(w) — A(z) is a function similar to the function R(u) of the 
lemma. If we do let R(w) = T'(u) — A(u), it follows at once that r(t) = y(t) — 
6(t) = f(t)-g'(—2t) + f(—dg (2) — 6(é). If we can verify that this R(u) 


9 The theorem is due to Cramér in the case in which G(0) = 0, and A(u) = G(u). See 
[1, Theorem 16]. 
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satisfies conditions (a) and (b) of the lemma, then we shall have established the 
relation, 


T(u) = A(u) — = [= +100) = 5(t) sew gy 


for all values of u, and letting u = 0 in this equation, we shall obtain (5.2). 

Condition (b) in the lemma is taken care of by (5.1) and the condition on 6(t) 
in Theorem 5.1. Clearly condition (a) will be satisfied if it turns out that I'(w) 
has a finite first absolute moment. Now the existence of finite first absolute 
moments of X and Y will insure the existence of finite first absolute moments 
for the conditional distributions involved in the definitions of T;(u) and T2(u), 
because H| X — zY| S E|X|+|2|E|Y|. It follows quite readily from 
this that the first absolute moment of I'(u) is finite. The proof of the theorem 
is complete. 


6. Distributions of variable form. We consider now the case in which the 
distributions of the numerator and denominator approach limiting forms. 

THEOREM 6.1: Let the independent variables X, and Yg have respective df.’s 
F(x) and G's(y) which depend upon the two parameters a and 8. Let Ha,3(z) be 
the d.f. of the quotient Zag = Xa/Ys. If there exist two chance variables X and Y 
with respective distribution functions F(x) and G(y) such that lim F.(x) = F(z) 


at all points of continuity of F(x), and lim Ge(y) = Gy), at all points of con- 
B00 
tinuity of G(y), then 


(6.1) lim H.s(z) = lim lim H(z) = lim lim H.4(z) = H(z) 
ao ao Bo Bow amo 
B—co 


at all points of continuity of H(z), where H(z) is the df. of the variable X/Y. The 
double limit in (6.1) is uniform in any finite or infinite interval of continuity 
of H(z). 

In the interpretation of the limits involved in this theorem, it is to be under- 
stood that in the hypotheses, a may tend to infinity over any unbounded set 
T, of the a-axis, and 6 may tend to infinity over any unbounded set 7's of the 
8-axis, provided that in (6.1), a and 6 are restricted so that ae T, and B e Ts. 

To prove the theorem, we introduce functions f.(t), g3(t), g3(), Te,s(u), 
Ya,a(t), which are defined by equations (4.1) and (4.3) with fF, G, X, Y replaced 
respectively by Fa, Gs, Xa, Ys. On the other hand, with reference to the 
distributions of X and Y, we employ the notation of section 4 without modifica- 
tion. According to the work in that section, ['(u) is given by (4.9) and its c.f. 
y(t) is given by (4.10). Also, 


Yaa(t) = fa(t)gs(—2t) + fa(—t)gp (zt). 


But it is an immediate consequence of our hypotheses that lim f.(t) = f(d), 
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lim g3(é) = g‘(é), and lim g3(t) = g (2), all of the limits being uniform in any 
B— Bo 


finite interval of values of t..° Thus 


(6.2) lim Ya,e(t) = lim lim ye,g(t) = lim lim ya,s(t) = y(0), 
aco aco Boo fo aco 
Bo 


uniformly in any finite interval on the ¢-axis. 
Consider the extreme members of (6.2). It follows immediately from a well- 
known general theorem” that lim T.,s(u) = I'(u) at all continuity points of 


a—0o0,8—>00 


T(u). Then since Ha,s(z) = Ta,s(0) and H(z) = I'(0), we find that 
lim Ha(z) = H(z), 


B-0 


Ze C(M,), 


where M; is the set defined in Lemma 4.3. By Lemma 4.4, the set Mz is the 
set of discontinuity points of H(z), so the equality of the first and last members 
of (6.1) is established at all continuity points of H(z). The uniformity of the 
limit is due to a general property of convergent sequences of d.f.’s; see [1, p. 31]. 

The existence and equivalence to H(z) of each of the iterated limits in (6.1) 
may be established by two consecutive applications of the foregoing argument, 
and by the use of (6.2). We leave the details to the reader. 

It is to be remarked that both H,,s(z) and H(z) can be represented by (4.11), 
provided, of course, that F and G in (4.11) are replaced by F. and Gg in the 
case of H.,, ; thus our theorem essentially states that the order of the double 
limit and the integration is immaterial in this formula. A similar remark 
applies to formula (5.2). 

The reader is reminded ‘that we have tacitly been assuming that the df. of 
any variable appearing in a denominator is continuous at the origin. In case 
G3(y) does not satisfy this condition, but G(y) does satisfy it, and if, as suggested 
in section 2, we consider H,,s(y) to be the d.f. of the conditional distribution of 
Za,s relative to the hypothesis that Ys ~ 0, then it can be shown rather easily 
that Theorem 6.1 remains true with this modified interpretation. But if G(y) 
is discontinuous at the origin, and if H(z) is interpreted as the d.f. of the condi- 
tional distribution, then (6.1) may be no longer true, as can be shown by trivial 
examples. 

Perhaps the most importgnt cases of variable distributions arise in the con- 
sideration of sums of independent chance variables. We accordingly present the 
following synthesis of Theorem 6.1 and a simple case of the Central Limit 
Theorem. 

THEOREM 6.2: Let U,, U2, -+- , be a sequence of identically distributed chance 
variables, each with mean zero and (finite) standard deviation oy , and let Vi, 


10 See [1, p. 30]. 
11 See [1, Theorem 11]. The result needed here is a trivial extension of the theorem 
cited. 
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Vz, °°* , be a sequence of identically distributed chance variables, each with mean 
zero and (finite) standard deviation oy. Furthermore, let the variables U; and V; 
be all independent, 1 = 1, 2,---,j = 1,2,---. If mand n tend to infinity in 
such a way that 


(6.3) lim 2 =k <0, 
mo m 


noo 


then the d.f. of the conditional distribution of the variable 
Wan = 


relative to the hypothesis that the denominator is different from zero, tends uniformly 
to the function 


” kev ov 1 
6.4 Sta} = [ bara arn 
(6.4) o TT ovku? + oz 
For if we let 
Ui + Uc+--- + Um 
Z in ouvm 
- VitVet+---+Ve’ 
ov~/n 
then Winn = VY m/n(ov/ov)Zmn- The Central Limit Theorem [1, Theorem 20] 
states that the d.f.’s of the numerator and denominator of Zm,, each tend to the 


function | (1/+/2n)e”” dt, which is the d.f. of a normal distribution with 
mean zero and variance one. By (3.3), the quotient of two variables, each of 
which has this d.f., has the continuous d.f. H(z) = [ (1/m)[1/(1 + 2”)] de. 


If we let Hm,,(z) denote the d.f. of the conditional distribution of Zm,, , relative 
to the hypothesis that the denominator of Z,,,, is different from zero, then by 
Theorem 6.1, lim Avy»,,.(z) = H(z) uniformly in z. Now the df. of the 


m—o,n—>o 





conditional distribution of Winn is Hmnl~/n/m(ov/ov)w], and because of (6.3) 
and the uniformity of the limit of H»,,(z), this approaches H[k(cy/ov)w]. 
Differentiating the last expression with respect to w, we find that the resulting 
frequency function is equal to J’(w); and this concludes the proof. 

As an application of the theorem, let us consider the following problem. 
From an urn containing white and black balls in the proportion of p to 1 — p, 
we shall make 100 random drawings of a single ball with replacement after each 
drawing. Let Wso,50 be the ratio of the deviation of the number of white balls 
in the first 50 drawings from the expected number, to the deviation of the number 
of white balls in the second 50 drawings from the expected number. What is 
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the approximate value of w for which P{Wso5 2 w|b} = .05, where the 
hypothesis b is that the denominator of Ws0,50 shall be different from zero?” 
To answer this question, we observe that the numerator and denominator of 
Wso,50 can each be expressed as the sum of 50 independent identical chance 
variables, each with mean zero and with variance p(1 — p). Thus according 


to Theorem 6.2, the approximate d.f. of Ws0,50 is 











w i 1 


J(w) = —co 1 + uU* 


















du = : + : arctan w, 
2 





and the required value of w satisfies the equation J(“) — J(w) = .05. The 
solution of this equation (correct to one decimal place) is w = 6.3. 

It is perhaps needless to remark that a study of the error involved in sup- 
posing J(w) to be the d.f. of W»,, in Theorem 6.2, must necessarily precede the 
unreserved acceptance of numerical results obtained by means of that theorem. 


7. Products of chance variables. We conclude this paper with a rather brief 
treatment of the distribution of the product of two chance variables. To pre- 
serve a notation uniform with that of the preceding sections, we shall write the 
product as X = YZ, where the d.f.’s of X, Y, and Z are to be denoted, as before, 
by F(x), G(y), and H(z), respectively. The existence of F(x) is readily proved 
by the methods of section 2. The assumption that P{ Y = 0} = 0 is of course 
unnecessary here, and will be dropped in this section. 

In the continuous case, an argument similar to the one employed in section 3 
will establish the following result: 

THEOREM 7.1: If the joint variable (Y, Z) has the frequency function Wy, 2), 


then 
F(z) = | | “i u(%, 9) av |e 
e— oo — 0 Vv Vv 
= il y (x, “) av | au, 
Lit cam 10 v 


and consequently F(x) is an absolutely continuous function of x. The frequency 
function of the distribution of X exists almost everywhere, and is given by the formula 


(7.1) f(a) = F(z) = [- = ¥(, ) dy = [ “ : | “ (, *) dv. 


In the discontinuous case, with Y and Z independent, we can write X = 
ZY = Z/(1/Y) and use Theorem 4.1 to derive a formula for F(x). We have: 





= P{iX <2} = P{Y ¥O}P{X S2|Y¥ £0} + P{X <2;Y = 0}. 


12 This hypothesis would always be fulfilled in case 50p is not an integer. 
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Excluding for a moment the trivial case in which P{Y ~ 0} = 0, let Gi(y) be 
the d.f. of the conditional distribution of (1/Y) relative to the hypothesis that 
Y~#+0. Then 


G(—-0) +1- a ‘ 0), 
P{Y ¥ O0}Gi(y) = <G(—0), 
G(—-0) —G ( - 0), 


It is to be observed that Gi(y) is continuous at y = 0. Using Theorem 4.1, we 
find that 


re) 0 

P{X <x|Y #0} =G,(0) + I H(2w) dG,(w) — | H(xw — 0) dG,(w). 
So 
PIY £O}P{X <x|Y #0} 


= G(—0) + [_aaual -6(2 - 0) | = [. aew- oa «(2 “ 0) | 


00 0—0 
= G(-0) + | n(2) dG(v) — l a(? - 0) dG(v). 
0+0 v — 00 v 
This equation is trivially true if P{Y # 0} = 0. Also, 


? 


G(0) — G(—0), 


P{X 22; Y =0} =| 


Thus we obtain the following theorem: 
THEOREM 7.2: If Y and Z are independent chance variables with respective d.f.’s 
G(y) and H(z), then the d.f. of their product is given by the formula 


co nin~[La()aon—[u(e-aann 4G 28 


for all values of x. 
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SOME GENERALIZATIONS OF THE LOGARITHMIC MEAN AND OF 
SIMILAR MEANS OF TWO VARIATES WHICH BECOME 
INDETERMINATE WHEN THE TWO VARIATES ARE EQUAL 


By Epwarp L. Dopp 
University of Texas 
1. Introduction. The logarithmic mean m of positive numbers, x and y, as 
given by 
y— y¥—2z 
(1) ni= i y— —_ x= 9 ~ F*_ 
loge y — loge _ log. (y/z) 
° ° e . 1 ° 
is of considerable importance in problems relating to the flow of heat. 
The logarithmic mean arises, moreover, in less technical problems such as the 


following: Given that incomes ¢ in the interval, x S ¢ S y, are distributed with 
frequency inversely proportional to ¢. That is, with k = a positive constant, 


(2) g(t) dt = (k/t) dt 


is the number of individuals with incomes lying between ¢ and ¢ + dt. Then, 
with x > 0, the total number f of individual incomes is 


(3) f= [ g(t) dt = k(log y — log x). 


The combined income g of the group is 
y 
(4) ¢* / to(t) dt = k(y — 2). 


And thus the logarithmic mean g/f of the two numbers z and y in (1) is the 
arithmetic mean of all the incomes; that is, the average income—at least to a 
close approximation if the group is large enough that integration may replace 
summation. 

Now m in (1) becomes indeterminate, if x = y. Nevertheless, if ¢ > 0, and 
xr—candy—c,thenm-—c. Thus, we may properly speak of m as a mean of 
these two variates, x and y. 

This logarithmic mean is one of a set of means studied by Renzo Cisbani’, the 
general form being 


1See Walker, Lewis, and McAdams, Principles of Chemical Engineering, McGraw Hill & 
Co., Part IV, Logarithmic mean temperature difference. 
2R. Cisbani, “Contributi alla teoria delle medie.’’ Metron, Vol. 13(1938), pp. 23-34. 
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ptte — r+j 1/z 
felted 


and the logarithmic mean appearing when x = 1, 7 > 0. 

In a chart between pages 28 and 29 Cisbani exhibits thirty varieties of these 
means (5). It will be noticed that z is indeterminate if a = b. 

Some methods for dealing with means which may become indeterminate 
forms I have indicated in a recent paper.® 

Now a generalization from a mean of two variates to a mean of three or more 
variates may sometimes seem to be immediate. However, for the arithmetic 
mean (x + y)/2 of two variates x and y, the function [min. (zx, y, z) + max. 
(x, y, 2)|/2 is as much a generalization as is the arithmetic mean (x + y + z)/3. 
Actually, the direction in which generalization is to take place is arbitrary. 
However, it is natural to expect the generalization to arise from a problem 
somewhat similar to one that may give rise to the original mean. And it is 
desirable that to the generalization should be carried over as many properties 
or characteristics of the original as is possible. 

In the foregoing illustration, we considered a single interval x S ¢ S y in 
which incomes are distributed in accordance with a relative frequency propor- 
tional to @(t). And the arithmetic mean of all these incomes was obtained as a 
logarithmic mean of the two range limits x and y, at least approximately, allow- 
ing integration to take the place of summation. If @(¢) had been kt *”, instead 
of kt’, then the average of all the incomes would have been the geometric mean 
of the two range limits x and y. 

To effect a first generalization, we shall now suppose an original interval 2 to 
t,, to be divided into n subintervals by points 2, such that 


(6) Zo <x es <M <M +++ S Sau < Be. 


For each subinterval x,_; to x, the same function ¢(t) will be used to describe 
the relative frequency; but the total population for this subinterval will be con- 
trolled by a positive constant k, , in general different for the different subintervals. 
This may be described as stratification. To make this more concrete, let us 
suppose, as before, that ¢(t) = k/t. Then, with 2 > 0, the mean M, which 
will be described more in detail in the next section, will take the form 


(7) u= aa k(t, — Xr) . 
Dt &; log (x,/a+-1) 

Applied to incomes, M would, like m in (1), give average income. To get 
some idea of the significance of k, , let us imagine that in some community there 
are f, individuals in the income bracket z,; to z,, say from $1001 to $2000. 
Let us suppose now that f, other individuals with incomes between $1001 and 
$2000 distributed in exactly the same manner move into this same community. 


3**The substitutive mean and certain subclasses of this general mean.’’ Annals of Math. 
Stat., Vol. 11(1940), pp. 163-176. See p. 171. 
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Then k, would be changed to k; = 2k,. But, of course, among the entire 2f, 
individuals the relative distribution of incomes is exactly the same as among the 
original f, individuals. 

In this interpretation k, is a weight for a bracket of items. But, taking M in 
(7) just as it stands, k, is the weight for the consecutive pair of numbers x, 
and z,. 


2. The first generalization. When ¢ is in some interval, J = (a, a’), finite or 
infinite, let #(¢) be a non-negative, integrable function of ¢. 

And in J let the points at which ¢(¢) = 0, if any, form a null-set. Then, with 
tin I, write 


(8) (i) = [ o(t) dt. 


And, supposing that in (6), a < 2, Qn < a’, set 
(9) f-= | g(t) dt = B(x,) — &(zx,_1); r= 1,2, ---,n. 
Tr—1 


Then f, > 0; since ¢(t) > O and is continuous almost everywhere in (z,_;, 
x,). Since in any finite subinterval of J, to(t) is integrable, we may set 


(10) u()) = | “Wd dt = | “ tp(t) dt. 


(11) g= | vat = We) — We). 


Now, by a mean value theorem, there exists a number t, such that 
(12) g-/f- = t., 0th Ot. 


Taking positive numbers k,, the weighted arithmetic mean of g,/f,, with 
weights k,f, is then 


(13) M= DI k,9r _ py kf[W(a2-) — V(a,_1)] 


te = Ot kee) — ee) 
If ¢(t) = k/t, this becomes the mean (7) associated with the logarithmic 


mean. Now, since for (13) the weights k,f, are positive, it follows from (12) 
that 


(14) m<tsMst, <a. 


Suppose, now, that b lies in J, and that subject to (6) each z, — b. Then, 
by (14), M@—b. And thus M is an internal mean of x, 21, +++ , Xn, although 
with the z’s all equal, M@ assumes an indeterminate form. 

In (13) the weights k, are applied to pairs of numbers, either to V(x,) — Y(a,-1) 
or to ®(z,) — ®(zx,_1), whereas in most weighted means, the weights are applied 
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to individual numbers. We consider now a form equivalent to (13), but in 
which the weights c, are attached to the individual numbers. It seemed possible 
to get a more general mean than (13) by abandoning certain conditions upon 
the weights c, which first arose. But such relaxing of restrictions leads to diffi- 
culties, as will be shown. By setting 


(15) oO = —kh, CC, = ky ; Cry = ke — kr, 
we may write M in the form; 
(16) M = 28 V(t) 
D0 Cr &(2;) 
On the other hand, if we choose c’s subject to 
(17) co < 0, C<— (otat-:--- +e.) forO<r<n, 
(18) = —- Tie: 


then positive k’s can be found to pass from (16) back to (13). 
The question arises whether if the conditions (17) are abandoned, and with 
the c, not all zero, (18) is retained as 


(19) > Cr = 0; Some c, ¥ 0, 


M in (16) will continue to be a mean of x, 71, --- , 2n, possibly, an externa] 
mean. 


It may be noted that the condition > c, = 0 arises from the fact that when 
parentheses are removed from (13), each k, is matched by —k,. 

By an example, it will be shown that under (19) alone, M in (16) may fail 
to be a mean. In (8) and (10) takea = 0. Then with n = 2, ¢(t) = t, take 
¢ = 1,c, = —2,c = 1in (16). Then 


ro — Qari + 23 
20 wa 2S. 
” 2(x%o — 241 + 22) 


Ifb > 0,€ = % — b, n = 2 — b, andé = 22 — b, then 


e 1é—27 +2# 


If now n = 2e, and é = 3e + €e, then 
(22) M =b+ (2+ 6+ &€)/2>b +1, as e— 0. 


Since M does not approach 6b here, when 2», x; , and x2 — b, in the manner 
specified, M in (20) is not a mean of 2, 21, and 2». 

We may enquire, further, whether the function M in (16) could be a mean if, 
discarding (13), (17) and (18), we put upon c, the single restriction c, > 0. In 
that case, if 7 < t < z,, then, since (t) and W(t) are continuous functions of 
t—see (8), (10)—it would follow that if each z, — t, then M — V(t)/@(t). But 
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if M is to be a mean of 2%, 2%1,--- ,2,, then M —twheneachz,—t. Thus 
we are led to W(t) = t@(t). Except possibly for points of a null set, ®(¢) and V(t) 
have derivatives ¢(¢) and y(t); and thus 


(23) v(t) = W(t) = t@'(t) + Ot) = tp(t) + O(0). 


But then, since y(t) = to(t)—see (10)—it would follow that (tf) = 0 almost 
everywhere in J; but ®(f) > 0, if f > a. Hence the assumption c, > 0 is not 
sufficient to make the function in (16) a mean of x, 71, -°--, 2n. 

In the simple case of n = 1, M becomes 


— Yair) — Yao) . 
(24) a? @(x1) — B(x») ’ 


and this is a symmetrical function of x and 2. 
The question arises whether if n > 1, M in (13) or (16) can be a symmetrical 
function of 2,21, °°: ,2,. Assume, if possible, that with z < y < z, 


. og) = Coa) + a ¥(y) + 2 ¥(2) 
™ Ph © aa + ate eee 


is a symmetrical function of x, y and z. Now if a/b = c/d, and b — d #¥ 0, it 
is well known that a/b = (a — c)/(b — a). 
Hence, if H(z, y, z) = H(z, y, x), and c) ¥ @, then 
(co — ce) (W(x) — W(z)] 
26 H(z, y,z) = — ————————7 
- 2 = = @)[6@) — 6@)] 


which is not symmetrical in the three variables. Then H is not symmetrical 
in x, y and z, unless, possibly, when cy = ¢2. 

Likewise from H(z, y, z) = H(z, z, y), we are led to the conclusion that H 
is not a symmetrical function of x, y, and z, unless possibly when ¢; = ¢. But 
Co = Cy = & substituted into (15) makes kj = ke = 0, which is contrary to 
hypothesis that k, > 0. Then in (25) the constants cp, ¢: and c: can not be 
chosen in conformity with (15) so as to make H(z, y, z) a symmetrical function 
of the three variables. 

Symmetry in two variables will appear, however, if the mean (13) reduces 
to a mean of just two variables as it does when each k, = k, constant, in which 
case, 


= V(x) — V(x) 
~ &(@,) = @@) 


(27) M 


Although in the generalization (13) symmetry is thus lost, another property, 
homogeneity is retained in what seem to be the most important cases. 

Most means Q(z, y, --- , w) in common use are homogeneous functions of their 
arguments. That is, if cis a constant, and Q(z, y, --- , w) and Q(cz, cy, «++ , cw) 
are both defined when z, y, --- , w lie in some interval J, then 
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(28) Q(ca, CY,*"", cw) = cQ(z, — « w). 


This homogeneity is associated geometrically with ruled surfaces, in particular 
with cones. 
With reference to (8) and (10), let us write 


_ Vy) — ¥(2) 
P(y) — B(x) 


(29) F(z, y) 


And now, let us consider a special variety of means obtained by taking in (8) 
(30) g(t) = t, 


where g is any real number. Then F(z, y) is a homogeneous mean; that is, 
(31) F(cx, cy) = cF(z, y). 


This is valid, indeed, even in the special cases, q = 0, —1, and —2, which lead, 
respectively to the arithmetic mean, the logarithmic mean (1) and to a second 
variety of logarithmic mean 


(32) m = ty 108 y/2) 
y—-x ’ 


exhibited by Cisbani. It may be noted that q = —3/2 leads to the geometric 
mean, and g = —3 to the harmonic mean of z and y. 

It is conceivable that for ¢(t) other functions than ¢*—functions not equivalent 
to ¢* in integration—might be used to lead to a homogeneous F(z, y) in (29). 
But such functions, if any, would hardly seem to be in common use. 

The M in (13) retains the property of homogeneity, at least for ¢(t) = t*; 
and so will also the more general means exhibited in the next section. 


3. Further generalization. The means of Cisbani (5) suggest the following 
generalization. Let p be an integer or the reciprocal of an odd integer. With 
the notation of (13), take k, > 0, and 


(33) F,= Dikf?, Ge=Likdg?, 

(34) M, = (G,/F,]'”. 

Indeed, if in (8) and (10), a = 0, then g, > 0; and we may take for p any real 
number except zero. Now, M% may be described as the weighted arithmetic 


mean of (g,/f,)” with positive weights k,f? . And hence M, is an internal mean 
Of 2, %1,°°*, Ln; that is 


(35) % & M, S&S 20- 


Pp 


Furthermore, if in (8), ¢(¢) = ¢*, where g is any real number, then M, is a 
homogeneous mean of 2, %1, °** , In- 
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Another generalization may be obtained by writing 
(36) m, = g-/f, 
(37) M, = [Zk.m? />k,]'”. 
And still another 


— kn l/Zk 
(38) M" = [mi'-m2? --- m,"]'*"’. 


n 


These means (37) and (38) are internal; and they are homogeneous, if F(z, y) 
in (29) is homogeneous. 

The foregoing means are not, for n > 1, symmetrical functions of 
21, %2,°*+,2%n. Nowthe mere abandonment of (6) may lead to functions like 
(20) which are not means at all. But symmetry may be introduced as follows. 
First, lay aside (6), but suppose that the z, are all different. Then let 


(39) Ses ea | . (t) dt, grs = | | tp(t) dt; 


Tr 


where r = 0,1, ---, (rm —1);r <s Sn. Then, let 


(40) Uz Ro V= 20°42; 


where U and V is each a sum of n(n — 1)/2 terms: Let W be the double-valued 
mean 


(41) W = +[V/U]”. 


Then W is a symmetric function of 2% ,271,---,2,. If, in (8), a’ S 0, then 
in (12) each g,/f, < 0; and in (41) the negative value of W is an internal mean. 
But the positive radical is external. On the other hand, if a 2 0; then g,/f, > 0; 
and the positive radical in (41) is internal. In this case, it may be well to use 
for W only the positive value of W. 

In the more general case where a < 0 anda’ > 0, the fractions g,/f, may have 
different signs. But, in all cases, at least one of the two radicals (41) is an 
internal mean of 2%, %1,°-:, %n. Moreover, W is homogeneous, if in (8), 
¢(t) = t’. 

Finally, let 


(42) tee Gralbeas 
(43) Z = +{[2mi.,.]/n(n — 1)}"”. 


Then Z is symmetric; and at least one value is internal. If a > 0, we would 
naturally take Z > 0; and this Z is then an internal mean. Moreover, Z is 
homogeneous if the m,,, are homogeneous; that is, if F(x, y) in (29) is homogene- 
ous for every x and y in J. 
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A STUDY OF R. A. FISHER’S z DISTRIBUTION AND THE RELATED 
F DISTRIBUTION’ 


By Leo A. AROIAN 
Hunter College 


1. Nature of the problem. Consider two samples of N; and Ne drawings, 
each sample drawn from one of two populations consisting of variates normally 
distributed with equal population variances o°. We define the two sample 

Ni N2 
Ze Lj 
means #, = *—— , 2 = a ; 
Ni No 
from the two samples 


x;’s and z;’s independent variates. We calculate 


N2 
/ a (x; — a2)” 
YS — m = N,-—1, m= Ne-1. 
Ne 
42 
The distribution of z = 3 log 7 is well known. 


Se 
2ni™ n}” "diol 
(1.1) P() = ——— —_——— 
B m4 9) (nye + nq)* +2) 
2’2 
We shall denote the ordinates by y(z). The purpose of this study is to discuss 
the seminvariants of the z distribution and also to find useful approximations 
for them; to show that as n; and ne approach infinity in any manner whatever 
the distribution of z approaches normality; to find the upper bound of the ab- 
solute value of the difference between the distribution function of z and the 
function determined by the approximate seminvariants of the distribution of z 
for n; and ne large; to approximate the z distribution by the Type III distribu- 
tion, the Gram-Charlier Type A series, and the logarithmic frequency curve; 
and finally to investigate the same properties with respect to the F distribution, 
2 
where F = e” = = . The non-existence of the moments of F for certain values 
S82 
of n; and m, is noted and explained on the basis of the distribution of the quo- 


tient y s 
x 


1 Presented to the American Mathematical Society, September 10, 1938, New York City 
in part; and to the Institute December 27, 1939 at Philadelphia. 
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2. General features of the z distribution. The z distribution is always uni- 
modal, asymmetrical if ny # ne, and symmetrical if ny = ne. We see that 
interchanging n; and ne is the same as replacing z by —z. Fisher [7] noted that 
the two parameter family of curves includes as special cases the ae curve, 
the x” distribution, and Student’s distribution. The mode is at z = = Q, Pe 
maximum ordinate is 


2ni™ ni 


y(0) = = — (ny a Ns a (ny +n) 


Nn, Ne 
o(3.5) 
or approximately 


1 4 
(2.1) y(0) = ak 13(2 + > for n; and nz large. 


The two points of inflection are 


(2.2) z=} log au + m+n V ni +n! + Qn? no + Qn, n2 + 2nina\, 


Ny No ) 


They are equidistant from the mode, a property also of the Pearson system of 
‘ d" Y\Z 
frequency curves [24]. Also lim 2” — y(2) 


gis dz n 


= 0. 


3. The moment generating function and seminvariants. The moment gen- 
erating function of the z distribution is 


o—-O0n+86 , (2 — 6 {m+ 0 
no \** B(™ =_ * = n yo I (* Q- ) (“3 ) 
3.1) M.(6) = = a = 7 ae _4 
( ) ( ) = B (%, = Ny r(’ ‘) r & 
22 2 2 


The seminvariants of Thiele are defined by the following identity in 6: 


9° 4 
(3.2) log M.(0) = 18 + v5 se mtMaton 


To find \, we take the logarithm of the moment generating function, expand it 


r 


' . 6 ; : 
in powers of @ and choose the coefficient of 3 A complete discussion of proper- 
r! 


ties of seminvariants may be found elsewhere [4]. 
4. The seminvariants of z. Now by the following formulas [11] p. 38: 


2 
(4.1) log T(1 +2) = +> a 4% - nes le| <1, 
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2 3 4 
(4.2) log M1 — 2) = a2 + + > += 


9 7 


where in both formulas 


n> 


s 1 1 1 
81 lim (1+ e+ 2434-4 


1 1 1 1 
pra tg tet 


Sn 


2 


log B(s{1 + 2], 3) = log x — oz + 02 5 — 


where 


Hence from (4.1) and (4.3) 


1 . 2 
log r ( £2) = slope —2(a +2) +5 (a+ 2) 


x s a 8 
~F (ate) +2 (at H)—- 


) Sn, Nn 2 2, we may write (4.4) as 


(4.4) 


1 
Ont 


flita\_, a 81 = cr _ *) 
(4.5) tog (1E*) = 3 tog r(n+%) +d ke 1 DF Sk. 


From (3.1) 


log M.(0) = log (* - ’) + log T (at) 


0 
+ 5 (log no — log ni) — log T (3) — log I (") A 


The results assume slightly different forms for (A) mn; and nz each even; (B) m1 
and nz each odd; (C) n; even, nz odd; (D) n; odd, ne even. The general formula 
for \,:2 for all cases is 


Since o, = (1 


(4.6) 
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. _ we f(-)'(r - 1)! (r — 1)! \ 
(4.7) Arse =_ dX (pee > 2k)* + (nz + 2k)" ’ 


This result is not so useful from the point of view of numerical applications as 
the formulas which follow. 


5. Case A, nm and n. each even. From (4.6) 


Ne—O0\ _ NM —2-—6 
log 1 ( 9 *) = log (maz 


5 2 a + 
(5.1) 
+ log (1 _ ‘) + log T (1 - 
+ 6 P - ‘ Ne ° 
Now log {1 — — There will be — — 1 series of 
mm k=1 k No _ 2 2 


co k 
this sort, and only one series of the type log T ( — ‘) =») > (°) as given by 
k=1 


(4.1). In the above expansion and those succeeding, terms not involving @ are 
omitted, since such terms are not needed in finding the seminvariants of z. The 


series log T ( _ 4 will always occur. Then 


ne— 
log T ( 9 


(5.2) 


or 


m—0\ si f0\ _ E (§) 
(5.3) og  (™>*) = F (8) - Oi & (g)- 


We remark that the double sum is zero if nm. = 2. Similarly 
m+0\  &(-1)*' ( 9 ) ( .) 7 
log (™F) = k m —2 r m— 4 . 
6\* 6\* 
* (2) “ (2) } 
or 


5) dogr(ME8) = FSD (AY 4 BDE MER (aN 


k=1 t=1 2l 


By use of (5.3) and (5.5) we have for the seminvariants of z, when mn; and m 


are even 
dno—1 1 tni—1 1 
: (s.- 2 1) + (-17 (s. - Z YI, r<| 2. 
i kr bat KP 


(5.4) 
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For \i:2 = Z we have by (4.6), (4.3), and (4.5) 


—— 1 in y—l 1 
(5.7) ~* >| (108 mo 2 ) - (1 "2 } 


k=1 


6. Case B, n; and m. odd. We have 


log F (* =) — (= — + log = ea es 
L-¢ 1-6 
+ log ( a ) + log r( 9 } 


(6.1) 


Expanding log T (? a 


h 


ne — 0 a a" 
log ("25 ) Em — 2 t km — ait +¢ 


8 g* 1 
+ sil *) + a (1 - z:) 8 


However “(1 = i)=} +3 tz +a ; + -+-+:,k > 1, which we shall denote 


hereafter by t.. Hence an sain 


(6.2) 


aa 6 = 81 e 6° _— 
63) tog (5%) a o(nt 2) + ery 


Also 


log iy (* #8 = log (: te?) 


(6.4) 


and 





(6.5) 


(6.6) 


k—-1 l=}(n1—3) g* 


se ~ al fete 


k=? 1=0 (21 + 1) +1)*. 
Combining both these results (6.3) and (6.6) we have 





LEO A. AROIAN 


( 3(n2—3) 1 ) 
~= _ ! — en 
Ap:z (r my t, = (2k + 1)" 


i $(n1—3) 1 
+> (—1) (« = p> area): r 2. 


1 5 (n2—3) 1 1 3(n1—3) 1 
(6.8) Au =2= ( lognz— >, ) — ( logn— > }. 


i «(2k & 3 2 k=0 2k +1 


7. Cases C, D, and values of s;, 0, , ¢. The formulas for case C, n, even, 
nm, odd are 


( i (n2—3 r $ni—1 
(7.1) ee = (= DI (t = = _! + (s.- * ty}, a 


fo «(2k +1) 2" f= kr/) 
ini—1l 4(n2—3) 1 


- l 
k 2 aei** 


k= 


The results for case D, n; odd, ne even are 


(4 ino—1 y §(n1—3) 1 
= ez (r— 12 . — me i \ : , 
73) me=-DHE(— DB p)H(-"(“- Laka}? 


§(ni1—3) 1 1 - 1 
se & 


1 Ne 
i) theme tee 2 
(7 ) l:z 2 0g nm ot a Ak + 1 k= k 


We list the numerical values of s, and t, k < 10. The values of s; are from 
Stieltjes [20], 


(7.5) 8s, = 0.57721 56649 (7.6) = log 2 = 0.69317 0206 
Se 1.64493 40668 1.23370 00550 
83 1.20205 69032 1.05179 97903 
S4 1.08232 32337 1.01467 80316 
8 = 1.03692 77551 = 1.00452 37628 


S| = 1.01734 30620 = 1.00144 70767 
S7 1.00834 92774 = 1.00047 15487 
Sg 1.00407 73562 1.00015 51790 
Sg 1.00200 83928 1.00005 13452 
Sio = 1.00099 45751 ° 1.00001 70413 


By means of the formula & = a - 5}. k > 1, t& was calculated from . 


From the well known results for the Zeta function of Riemann ¢(s), [22], (p. 265, 
p. 267), 


" “1 1 f* a *e* . 
(7.7) fi =s.= >d- =o l ek s ¥ b> i. 


k=l ks 
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1 ; f « 
"7 s => _— . § «aa | ‘ccna 1 
(7.8) _ ( zs) (s) T'(s) Jo em +1 = 


(79) t= ¢(s) ¢ - +): 


8. The mean of the z distribution. From our previous formulas for Z we 
prove that if m. = m,Z = 0, andz < Oforn, > m,2 > O0form > nm. The 
maximum absolute value of ;:. will occur when n; = 1, ng = ©, orn; = ©, m = 1, 


and from (7.4) or (6.8) we have max | \;:. | = 5 + $ log 2 = .6352. 


9. Formulas for )o:2 5 Me:2 5 As:z 5 Maz y Ad:z » ANG ws:,. We have four cases from 
(5.6), (6.7), (7.1), (7.3): 


$(n1—2) $(n2—-2) 
E — . — 1 
as mi (O 

(9.1) 


= 822467 — — 4 : nN, Ne even. 
k= c” — - 


1 (3mx®) ‘ ) 
( es .467401 — - manne Bl 1, M2 odd. 
(9.2) Ye: = 2.46740 i( a +9 é - ; Ny, N20 


$(ni—1) 3(mo—e 
(9.3) Aee = 1.644934 — r( - 1 é 
‘\ a # 
: . , 
(k > 3)? k=1 


In all cases of course de., > 0 and moreover dy:. ~ 0 asn,andnze—> ©. We list 


1 §(n1—3) 
(9.4) ez = 1.644934 — ( 
4 k=0 


tno—l 1 


ni-— 
(9.5) : == >» ™ 1, No even. 
5 : k=1 


(9.6) 3 = — aay, Ny, Ne odd. 
(9.7) ; : ——, n, even, Nn odd. 


. 1 
(9.8) — 1.803085 + i O50 - i. : n; odd, n2 even. 


ino—l 


(9.9) ' 811742 — = 5) Ny, Ne even. 


(n 4(n1—3) 1 


Qk+ii? & e+ 


k=1 


3 
(9.10) YAa:e = 12.17614 — 6( 


) N,, N2 odd. 
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$(n2—3) 1 $ni—l 1 


+ pe ) ny even, Ne odd. 


9.11) Ag:. = 6.493939 — 6 2 
( ) : 6( imo ©6((2K + 1)4 im ik 


ina—2 4 }(n1—3) 1 
9.12) ra:2 = 6.493939 — 6 : a 
ee ( 2 k4 7 zo (2k 4+ 1)! 
We see \,:, > 0 whenever riseven. If ris odd X,.. < 0 if m2 > nm, and ,., > 0 
ifm >nz. Also y,:.>0, m1 > nz, r odd, greater than one. Similarly u,., < 0, 
rodd > 1,m>m. 


), n, odd, nz even. 


10. Skewness, excess, and values of a,. We take for our measure of skew- 
bs As , . ; 
ness a3 = ua? = aR: For m2 > mn, a3 < 0. Further the skewness increases 
negatively if nm; remains constant as m2— ©. Thus negative skewness will be a 
maximum for m2 = ©, n; = 1, and positive skewness will be a maximum when 
no = 1,n,; = ~. The absolute value of maximum a; is 


rt: 
(10.1) las | = 1 = 1.5351. 


' M4 M4 : 
As our measure of kurtosis we use ag = 3 = 3 + x2 As a measure of excess, 
2 


Me 
Ag 


E, we use EF = a — 3 = x2: 


The excess is always positive. 


11. Approximations for \,:.. by the Euler-Maclaurin sum formula. The exact 
results given previously for the seminvariants become unwieldy for n; and n 
large. Hence we develop useful approximations for the seminvariants, and give 
the maximum error of the approximation. We find first our results for ),., 
when 7; and v2 are even andr > 1. We begin with (5.6) 


ey —s x ‘( -'¥ a) 
Ane = 3 8 Ze ge) t(-D' le — 2y pe) 


and rewrite this as 


(r—1)!f G1 7 oe li 
11.1 A:z2 = ——— — —1 — >, 
em 2" re, kr 7-8 Jn, kr J 
Now find the two sums of (11.1) by the Euler-Maclaurin sum formula [21] 
using the first three terms, and obtain 


— (r 5 (a = 1 4 (~~ Sts ‘) 


No ny 


+d ( 24+) 


3 ne ny 


r+3 + +3 


Ne ny 


45 


_ rr = Yr + YO + BY 1 —- 
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We use the following theorem [10] (p. 539), to find the error: 

If f(x) is of constant sign for x > 0, and together with all of its derivatives, 
tends monotonely to zero as x > «, Euler’s summation formula may be stated 
in the simplified form 


Si= [Watt o+Bn-wt-- 


z=0 


(-—1)"" ' Box (2k—1) (2-1) OBon+2 (2k-+1) (2k-+1) 

Qk)! (f; ‘) + Gk +a -f, °°) 
where 0 me <i and By => 1/6, B, = 1/30, B, => 1/42, Bs = 1/30, By = 5/66, 
ete. If we use 


_ (r—2)! m+r-1 mt+r—1 
(11.3) ns = (” Ee Ena frat), 


ns Ny 


then the error committed is of the same sign and less than 
rtf 1 (—1) t 
ae: ae 
3! \nzt nit 
If we take 
an ! i —— a 
dre = ESN (MtLH 1 y (ay mtent) 


2 Ne ny 


r(r — 1) € (~aF )| 
7 3 nit) + “ | 


then our error is less than, and has the same sign as 


_(r+2)!f 1 =) 


~i 5 
90 No” nit 


(11.4) 


Finally if we use (11.2), our error has the same sign as, and is less than 


cre (—1)"| 
945 nits nits J : 


12. Approximations for other values of n; and m.,7 > 1. Now in case 
and n2 are odd we have from (6.7) 


(12.1) ee = r= D1 ys 


< 1 
sates, toil ee 
k=}(n2—1) (2k + 1)" k= ee (2k + 1)’ 
Applying the Euler-Maclaurin sum formula to each of the sums in (12.1) we 
are led to exactly the same results given in paragraph (11). The other cases 
are obvious combinations of the sums in (11.1) and (12.1), and so for all values 
of mn; and n2 the approximate results for ),:., 7 > 1 are 
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nee = Cet et 4 (ay mtr} 
2 Ne ni 


(12.2) ( 
arf ld . yy _ @+2)!f 1, (17 
6 \nt* “ 90 " nits |' 
Formulas (11.1) and (12.1) prove the result previously given for X,-:2 (4.7). 


13. The approximate values of \;... From (5.7) 


1 tno—1 1 ini—l 
Ais = = | (toe Ne — 7 :) (tog nm — Z. |, nN; aNd No even. 
2 k=1 k k=1 k 


We use the Euler-Maclaurin sum formula on the sum 
tnog—1 tno—1 
E i-{E(3.)- 2\ 
k= k \ ko \E +) N2) 


and the similar sum involved in );.,. Hence we have 


1/1 1 | 1/1 1 1/1 1 
13.1) Auz = el——- 7 +-(—--—)-~—| - : » Me > 2. 
' , - (. "1, 6 € ) 15 (5 3 em 


The errors committed by using one, two, or three terms of (13.1) are less than, 
and of the same sign respectively as 


G) -aGe3 8-3 
6\nz n?/’ 15 \ni nt]? 63 \n$—n8/ > 


For n; and nz both odd we find the same result as (13.1). The restriction n,, 
Nn, > 2, may easily be replaced by n;, m2 = 2 (for m1, ne even) and n,, nm 2 1 
(for 2; , Nz both odd). When 7; is odd, ne even, the formula is again the same 
as (13.1) if m; and ne are sufficiently large; but if m: and ne are small we find 
in this case 


1/1 1 1/1 1 1/1 1 
a es ae Be ae a 
oi (7. *) ° 6 (", *) 15 (7. ‘) 
1 1 1 1 i 1 1 
a” ” —- = ——])]-.- 2. 
T 5 (1 2 T 6 (1 ) 15 (: i) 9 18 


Another method of finding (12.2) would have been to use the asymptotic ex- 
pression for log I'(z). 


(13.2) 


14. Approximate values of i,:, for values of 7. We list the approximate 
values of X,:, to three terms. 


1/1 1 1/l 1 1 1 1 
Mz = ; oa =i 3) ~ sel 7 
2 (2 *) ™ 6 ( ‘) 15 (4 *) 
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m+1 1/1 ) ) 

‘tiene Z ale + ange seni a aa 

: ns n? : ni 

+ 3 
083) aa(ed) abe) 
no Ny ne ny 

der = 3 (= re =) + 20 o(4,- ) = 4 
No ny No ny Ne Ny 


res = 12(™ 45 445) 4 r00(1 4 1) -a18(44 2). 


Ne ny Ne ny Ne ny 


The approximate values given by Cornish and Fisher [8] (p. 319), are similar, 
but have fewer terms. Cornish and Fisher give no remainder term. From 
(14.1) and (12.2) we see the maximum absolute values of do-41:2, 7 2 1, occur 
when m2 = “©, = lorm = 1,m, = &. Similarly d2,:., 7 2 1, has its maxi- 
mum value for n; = nm. = 1. The standard seminvariants of z are defined 


;.,7 2 2. We also note that for no > 1, &41:. < 0,7 2 1 and hence 


; Mn ; 
ar+1 < 0 also where a, = —~. Moreover the maximum absolute values of 


bor., aNd &731:2 occur When n; = 1, m2 = © Or ne = 1, mn = ©; and also for ae, 
and a,4;. Approximately then 


(14.2) max é,:, = (—1)’ _ Aa 


. 6%, vy 
The exact value for maximum a,:, is 3 + — = 7.07. 


to 


15. Approach to normality of the z distribution. We prove the theorem: The 
distribution of z approaches normality as n; and nz — © in any manner what- 
ever, with Z = (> — 1) 0, = (2 4- 4 . We also find an upper bound 
of the absolute value of the difference between the z distribution and the func- 
tion determined by the approximate seminvariants of z when n; and nz become 
large. To prove the theorem we start with the original distribution of z, and 
find when n; and n are large, 


1 ma tne hrm? 
15.1 P(z) = —— { ; a 
waa «/Se @ a, \me* + no) . 


We change to standard units z = to, + 2, then 
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] Pree nyto+ny2z 
(15.2) P(t) = Tz = aes \ eet de = ww StS w, 
T Ne 
We rewrite this as 
1 . $(ny+ngQ) 
(15.3) P= ; 2 (to +2)/( a “—2ny (to+2)/ (ny + =f dt. 
/ 20 | 2no + ny+ne +. Ne e 2n 4 (to+zZ)/ n1- no 


2ng(to+z2)/(ny+ng) —2n, (to+zZ)/(ny+ng) 


Expand nye and nee and add term by term. Divide 
this result by n: + m2 from the numerator of P(t) to obtain 


2ni nto + z)° ‘7 1 \ 
15.4 1 a 
( ) * (n: + n2)? ? (ni + n2)?f * 


Hence 


27 no(to + 2)° iy (ni tne) 
1 it. 
i | 7 (m1 +m)? f ¢ 


We evaluate (15.5) for n; and ne large by using logarithms. 


- ny x Neo log fi4 2n no(to -b * 


(15.5) P(t) = 





(m +m)? | 
_ _ mtn I J2ninelto + z)"\ 1 [2nyno(to - y\? 
2 L(mtm? f 2. mtm? | 








= (—1)"" Sell 
+ 2, -— 1 To oy. 


This gives 


ae > (Po + 2toz + 2") + nin tie + 3)! + > (--1)' {2n1 no(to + ayy 





(ni + ne)? r=3 2r(mi + ne)" 
We reduce this then to 
- t — gt (Z0— oy ih 1 { Qni ns (to + 2)* 
2 2 2 \(m + m)2) M1 + Ne 
+ terms involved in the above summation. Let U = o |Z < co. Since 
, ; —_ a U* 
lim o=0, lim U=0. ‘Semularly lim *“° = lim =(. Con- 
n1,nNQg—7>o n1,nQ9-%o n1nQg—- N1,nNQg—7>eo 
3.4% r\4 
: 9 *(t 
jie — (tc + 2)* = ~s +2) (t+ U) . Hence lim (t+ UY 
(ny + n2)* a. + m) ~ A(m + ne) nisne-veo 4(1 + Ma) 


0. In like fashion 


. (—t7 f 2n1 N2 y (to + 2)” a i o * (ta + 2)” 
( 


r=3 \m + ne (mi +m) 23 Ar(m +m) 


Now clearly from our previous discussion for r = 2, we see 










de 


ee 
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: oo (—1)’ o "(te + z)" 

lim i Ml a. as 
N1,NQ—70 2d, 2r (m ae N2)*1 
This completes the proof. 


We now consider the function, f(z), determined by the approximate semin- 
variants of z. We start with 


Auz = at —_ ) and Anz = — {2 2 = * (— 1)’ mtrat}, r> i 


N2 MN n> ny 





from (12.2) using only the first term. We may easily prove then that as 1 
and m2 approach infinity in any manner whatever the function f(z) represents 
a normal frequency distribution with 


a 1 1 a>? m+1 
a5 a) and weep (™Et + MEY), 


This further shows the identity of f(z) and y(z) in the limit as n; and n.— . 
Since the moment generating function of f(z) is 


4(ng—1—6) 3(ny—148) 
a 
Neo ny 
we have 
6 4(n2—1—10) -a\ 3(n1—1448) 
(15.6) fe)= 2 [oe “(1 -*) 1+° dé. 
2 1 


I have not been able to evaluate (15.6). We instead shall find an upper bound 
to the difference | f(z) — y(z) | as m1 and n become large. We form f(z) — y(z). 
Then by use of Stirling’s formula for n! with the remainder term and by the 
Fourier Integral Theorem, 


(15.7) | f(z) — y(z) | S (e2/0"*8s/"2 _ 1)y(z) where 0 < Bs <1,0<&, <1, 
and 


(15.8) lim | f(z) — y(z) | = 0, and for this case f(z) = y(z). 


ny,nQg—7o 
Of course (15.7) furnishes the upper bound of the absolute value between the 
frequency distribution of z and the function determined by the approximate 
seminvariants of z for any values of nm; and ne. 


Up to this point we have assumed that there exists a function determined by 
the seminvariants 


— 2)1/ — 
ues = (+ - 2) and has Se EET ato yetentt > s. 


2 Ne ny no Ny 





This may readily be proved by using the following theorem [18] (p. 536): The 
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determined character of the moments problem for an infinite interval is insured if 


0 eo 

—1/2 . 
>> cp,” diverges (c. = [ 2” ar(2)). 
n=1 I 00 


16. The Pearson types of approximating curve. In discussing the types of 
the Pearson system which may be expected to approximate the z distribution 
we shall use the results of H. C. Carver [1], and the further exposition of C. C, 


° rn . 2 ae 3 a 6 
Craig [3]. To find the Pearson type we compute 6 = «a a a 
Q4 . 


v2 (ny — No) 7 
Vi no(m + ne) 


shall find it convenient to use the approximations a; = 


and 


(ni — Ny Ne + n3) 


Siieensbnes to obtain 
M1 N2(M + Ne) 


a=3-+ 4 


mes) 6 = try 
3n? no + 3n, nz + Qn? — Anne + nj’ 


and consequently 0 < 6 S 3. The only possibilities are Types IV, VII, VI, 
or V since the greatest value of a3 by (14.1) is 2.3565. Now if mn. = n2, we have 
Type VII, since aj = 0,6 > 0. In all other cases we shall have Types IV, V, 
or VI according as a3 < 46(6 + 2), a3 = 46(6 + 2), a3 > 46(5 + 2). We 
neglect &. Hence a; < 86 implies 


ns(ny — 2) + n3(15ni + 61) + n3(15ni — 8ni) 
iene + no(n4 at 6ni) — gni >6 
A simple investigation reveals then the following results: 
Type [V for n, ne = 2, m1 ¥ ne. 
Type [V form, = 1,1 S m S 21; orn, = 1,1 
Type VI for n, = 1, m2 > 22. 
for ne = 1, m > 22. 
Type VII for ny = ne. 
Clearly the z distribution has features comparable to Type IV since both have 


infinite range. However, Type IV is irksome to fit in practice. 


17. The Type III approximating curve, the logarithmic curve, and the 
Gram-Charlier Type A. The criterion for Type III is 6 = 0, a3 # 0. We see 
that as ; and nz increase the value of 6 will decrease. Even for small values 
of nm and n. Type III will furnish a fair approximation to the z distribution. 
For example n; = 10, n2 = 5,6 = .094. The advantage of the Type III approxi- 
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mation rests on the fact that Salvosa’s tables may be used. From the chart in 
[16] since a3 < 2.3565, we are assured that the approximating Type III curve 
is bell shaped. For mn; = 1, 2, m2 = any value, this approximation is not all 
that could be desired, although even in such cases it does have value. We note 
that Type III has limited range at one extreme (- z. ~) while the range of 
Qs 

the z distribution is (—#, «). Salvosa’s tables extend as far as a3 = 1.1, 
and since max a; = 1.5351, we see in some cases, and these only for m = 1, 
n, large, we shall be obliged to make use of Pearson’s Tables of the Incomplete 
Gamma Function [14]. The logarithmic frequency curve 


1 1 u—a\’ 

ne V/ 2x clu — a) — | 2c? (toe —b ) | 
will be useful in approximating the z distribution. While it has been discussed 
by many authors we shall follow Pae-Tsi Yuan [23], where a full bibliography 
may be found. In our discussion we use the 6; = a3, Be = ay chart of the 
Pearson system as given by S. J. Pretorius [16] (p. 147), since the logarithmic 
frequency locus connecting a3 and ay, is already drawn in. The justification of 
this curve for fitting is due to the fact that in the 6, , 62 chart of the Pearson 
system as given by 8S. J. Pretorius [16] (p. 147), the logarithmic frequency locus 
lies in the Type VI region between the Type III locus and the Type V locus, 
and consequently closer to the Type IV region than Type III itself does. Hence 
since Type III fits fairly well under certain conditions and Type IV fits well we 
can expect the same for the logarithmic curve. Furthermore when az is small 
the logarithmic curve is similar to Type III [23] (p. 42), and as a3 becomes 
larger, a3 = 1, the difference between the two types is pronounced. However, 
it is just when a3 becomes large in the region n; = 1, ne = 22 that we find the 
logarithmic curves give a fine fit, since in such cases the point (a3 , 82) lies prac- 
tically on the logarithmic locus [16]. To fit the curve [23] (pp. 37, 48, 49), we 
find the values of the three parameters a, b,c. To find c we solve the equation 
w + 3w” — (4 + a3.) = 0 for w using the table [23] (p. 48) given by Pae-Tsi 
Yuan. Knowing w we can easily solve for 


= (log w)', b= (2+) won, 


3:2 


_ (wt 2)o, 2-2 ety 
zZ— —————__ t= — = —,—_—_ 
Q3:2 Oz (ée a 1) 
where the value of x must be obtained from the table of areas under the normal 
curve, if the z distribution is approximated by use of areas. 

Since the Gram-Charlier Type A series generally approximates a Pearson 
Type IV fairly well when aj is not too large, it is to be expected that the Type A 
series will approximate the z distribution in those cases when 7; = nz, and also 
when a3 is not too large. 


a= 
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18. Levels of significance and approximation methods. We shall apply the 
results of the previous paragraphs to the determination of the value of z for 


any level of significance a, i.e. the value of z such that [ y(z)dz = 1—-~«@ 


We have such levels as the median (the 50% point of significance), the 20%, 
5%, 1%, and .1% points as given in [9]. Where these tables apply there is no 
need for other methods. It would be desirable to extend the results for any 
level of significance whatever. The methods which we shall use are (1) the 
logarithmic frequency curve, (2) the Gram-Charlier Type A, and (3) the Type III 
approximation. For finding the levels of significance by the Incomplete Beta 
function, the reader is referred to [13], (p. lviii, topic (viii)). The logarithmic 
curve is very simple to use in conjunction with the table of areas under the 
normal curve. From Pae-Tsi Yuan we have 


ze—hc2 - 
(18.1) t= ( =3 a where (e° — 1)! 
é — 


takes the same sign as a3. The value of z is obtained from the table of the 
normal curve, 1.64 for the 5% level, 2.33 for the 1% level; the value of c is 
obtained from w (17.1), and consequently the value of ¢ (18.1). Then we have 


—2Z ; a 
“__“ to solve for za , where 2, 
Oz 


and oa, are the values of the mean and standard deviation of z as given by the 
proper formulas in (5), (6), (7). We illustrate with examples: 


a 
if 2. = value of z for any level of significance, t = 


(18.2) 5% point of z,m = ©~,m = 1. as = 1.5351, w = 1.2264, c = 1.64, 
t = 1.88, 2 = .6352, o. = 1.11, and asa result zs, = 2.72. Fisher [9] gives 2.7693. 


We can also find 237 easily form; = 1,m2= ©. Hereas = —1.5351, w = 1.2264, 
x = —1.64, ¢ = 1.197, 2 = —.6352, o. = 1.11, 2, = .694 compared with 
Fisher [9] 25% = .6729. 

(18.3) 1% point for m = 4, n. = 8, Z = —.0701, oz = .4819, as:, = —.3619, 


w = 1.0144, t = 2.17 and z:%, = .976, while the accurate result is .9734. 


From experience the values of z for any level of significance obtained by the loga- 
rithmic frequency curve will possess an error less than 2% of the true value of z 
for the level of significance if n; and ne are greater than twenty. It would 
seem that for other values of n; and nz the error could not be greater than 10%, 
and usually would be much less. 
19. The Gram-Charlier Type A. We take the series in the form 
—4ht2 
€ 
F(t) = ft) + Ase? O+ Ae), gO = S= 
V/2r 
Z2—-2Z Pe ac As:t 
_ Cz ' As = “a 4!° 





Some examples follow. 
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(19.1) We use the material of (18.3) and employ three terms of F(#). Zz = 
—.0701, o. = .4819, A3:. = —.0405, Au:2 = .0336, As; = .06032, A, = .02596. 
Fitting F(t) by ordinates we have ¢. = 2.17, and consequently z = .976. 

(19.2) We take mn = ne = 5,2 = 0, o, = .4952, As:2 = 0, Aa:e = .02798, As = 0, 
Ay = .01939. 

5% point: By ordinates t = 1.57, zsy%, = .777, while Fisher gives .8097. 

1% point: By ordinates t = 2.325, 21%, = 1.15, while Fisher gives 1.1974. 


(19.3) We take m = 3, ne = 20, 2 = —.15909, o. = .5099, Az. = —.10222, 
M:z = -08822, As = .12854, A, = .05438. By ordinates t = 1.523, 25, = .618, 
Fisher gives .5654. ¢ = 1.989, zi, = .855, Fisher gives .7985. The Gram- 
Charlier Type A is recommended only for mn: = ne and n,, nz = 20. 


20. Type III approximation, the median, and 5% point. Since for Type III 
the median, m,, is approximately two-thirds of the distance from the mode 
to the median if a; is moderate [12], [6], then we have further assuming 7 , 
No = 20. 


1 1 1/1 1 
(20.1) m, = +(2 — 1) + 5 (3 — *) . 
From experience this result will furnish an accuracy with an error less than 2% 
of the true value in the range above indicated. 
(20.2) ts, = 1.6437 + .2760a; — .04506a3 . 
This was found by use of Salvosa’s tables and for a3 > 1.1 by [14]. 
(20.3) 2s, = o,{1.644 + .276003:. — .0451a%:2] + 2. 
We illustrate the use of (20.3) with some examples. 
(20.4) m= m= 1, o. = 1.5706, o3:.=0, 2=0, 2% = 2.582, 
while the accurate value is z5% = 2.5421. 


(20.5) my = ©, nm = 1, ag = 1.5351, Z = .6352, o, = 1.11, 2% = 2.81. The 
accurate value is 2.7693. 


(20.6) mm = ne = 5,0, = .4952, a3: = 0, Z=0, 2% = .8141, while the 


= .8097. 


(20.7) m, = 4, nm. = 8, Z = —.0701, o, = .4819, ag = —.3619, 26% = .6712, 
while the accurate value is .6725. 


accurate value is 2s 


(20.8) mn, = 1, ne = 10,2 = —.5835, o, = 1.1353, as = — 1.4333, 25%, = .7283, 
while the accurate value is .8012. 


In a future paper exactly the same methods will be used for any per cent point 
of z whatever in order to compare with the results of W. G. Cochran [2]. If 
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nm, and nz are large we may use the approximate formulas for o., as3:;2, and 2 
to obtain to the order of o% , 


(20.9) zs, = 1.6440, + .7760 (2 - ), where ¢, = / 1 (~ a ~). 
NM 2\n2 Mm 


We expand Fisher’s result [9] 





1.6449 - 1 1 ee 1 
2% = wa i + .7843 os eee by the binomial theorem, where h = 5? to 
_— LD 1 z 
obtain a comparable result 
g - 1 1 
(20.10) 2% = 1.6450, + .7848{ — — — }. 
Mo Mm 


The numerical examples given in this chapter illustrate unfavorable cases as 
well as favorable ones. 


21. The distribution of F. Historically Snedecor [19] was the first to use F 
fore”. We find 


4 ino kny—1 
ni"' ns”? - 


: : 7 k(n ma 
2 (3 _ (ni F + ne)?"*"* 
2° 


(21.1) P(F) = 


The distribution of F is J shaped if n; S 2, and bell shaped for n; > 2, and for 
N2(n — 2) 
m(n2 + 2) 


exist for n; 2 4, are equidistant from the mode. ‘The moments are 


ny + 2m Ng — 2m 
’ (nye 2 )r( 3 ) 
Um = | — ————— - ——A 


-, Ne > 2m 
nN 


n, > 2 one mode exists, Fy = The two points of inflection, which 


nr No 
(z)e() 
i= a = ee, 
20/2(2n1 + ne») 
~ -9/ natal + M2) 
The exact results for u3, us, a3, and ay are omitted because of length. We 
have the theorem that as m1, m2 © in any manner whatever the distribution 


3: F 


of F approaches normality with mean F = 1, of = (2 + 1). The proof 
ni Ne 

is omitted. The only type of approximating curve of any value is Type III. 

Of course the distribution of F is Type VI. No tables exist for Type VI. 

Furthermore the F distribution approaches the Type III function so slowly as 

to make most approximations of little value unless a3:7 < 1.1. Other possible 
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ny(me + 1 mF : 
parameters are 6 = ae F, and H = eri [13]. Since | as:47 | = 


2| as:2| approximately we see that the distribution of H is more skewed than 


‘ ‘ i 2 © 2 n ) n 

that of z. We mention briefly also Sj; — S_ where S; = 7 Si, S, = Ww 83. 
iVi iV 

Clearly z, F, 6, and H give equivalent levels of significance. This is not true 


v2 2 
for z and S; — So. 
» 
: + 1 
Finally, since F = -, 
oo 


2? it may be interpreted as a quotient [5]. When the 


— i i ‘ i ° ° 2 
moments of F do not exist, it is due to the distribution function of s9 . 


22. Conclusion. We have found the seminvariants for the z distribution, and 
approximations for them. Type III, and the logarithmic normal frequency 
functions are shown to be excellent approximations to the z distribution. The 
approach to normality for the z distribution is proved. A formula is given for 
finding the 5% level of significance for z. The F distribution is studied along 
the same lines. As far as the construction of tables for levels of significance is 
concerned, the z distribution is much easier to use. My sincerest thanks are 
due Professor C. C. Craig for his helpful guidance and many suggestions. 
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THE DOOLITTLE TECHNIQUE 


By Paut 8S. Dwyer 


University of Michigan 


1. Introduction. Most authors who have presented the Doolittle method, 
from Doolittle [1] down to the present, have not given any formal proof that the 
solution is valid in the general case. They usually are content with a form 
describing the various steps of a Doolittle solution. 

The author has recently shown [2] that the Doolittle method can be abbrevi- 
ated to a technique which is also an abbreviation, essentially, of the method of 
single division and its abbreviation which Aitken called the ““Method of Pivotal 
Condensation” [3]. It appears at once that the validity of the Doolittle method 
follows from the validity of the method of single division—a validity which is 
readily established. 

However one may desire a “proof” which is based directly on the Doolittle 
technique without referring to other methods of solution. It is the chief 
purpose of this paper te present such a proof. It is accomplished by the intro- 
duction of a notation which precisely describes the conventional Doolittle 
process and by proving that this process results in a system of equations whose 
prediagonal terms are zero. It is a secondary purpose of the paper to emphasize 
the advantages of the Abbreviated Doolittle method and to explain and illus- 
trate minor variations in the conventional Doolittle technique. 


2. The Abbreviated Doolittle solution. We first direct our attention to the 
essential parts of a Doolittle solution and these are the last two rows of each 
matrix of the standard Doolittle presentation. The additional rows in the 
standard presentation are rows of products which are used solely for the purpose 
of finding the two bottom rows of each matrix and they need not be recorded, 
if a computing machine is available, since the essential information is present 
in the two bottom rows. Doolittle [1] did not have calculating machines (he 
used multiplication tables) but he put the important information in Table A 
and carefully segregated the supplementary information in Table B. With 
reference to this he wrote [1] 

“It is to be observed that the numbers in Table B have but a single use while 
those in Table A are used over and over, and where the number of equations is 
large, it is of great advantage that they should be thus tabulated by themselves 
in a form compact and easy of reference.”’ 

For purposes of proof, as well as for purposes of calculation if a computing 
machine is available, it is only necessary to utilize the forward part of the 
Abbreviated Doolittle solution which is the equivalent of the Doolittle Table A. 
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A four variable illustration of the Abbreviated Doolittle technique is presented 
in Table I. The successive equations are indicated by number, as is customary, 
and the operation which defines the equation is specified. The actual operation 
is indicated more explicitly by the notation of column 3 and this is discussed in 
the next section. 

The presentation of Table I introduces one variation from the standard Doo- 
little method. The division is made by the diagonal coefficient of each row 
rather than by its negative. One may still use the old technique, if he prefers, 
but it is felt that one can subtract products as easily as he can add products with 
modern machines equipped with automatic negative multiplication. In addi- 
tion the entries of the equivalent rows then have the same signs and, too, it is 
not necessary to take the time to change the signs of the second rows. This 
variation uses the same division method as the method of single division [2] 
and as the method of pivotal condensation [3] so that the abbreviated form of 
these methods is, essentially, the same as the abbreviated form of the Doolittle 
method. 

The application of this technique leads at each step to a coefficient for each 
variable. However if the process is to lead from our four equations in four 
unknowns, to three in three, to two in two, to one in one, it follows that all the 
entries to the left of the diagonal, which we may call prediagonal entries, must 
be zero. That this is true in the general case is the objective of the proofs of 
later sections. 


3. A notation for and description of the Doolittle technique. A main contri- 
bution of the present article is the use of a notation which describes the Doolittle 
technique. As long as the Doolittle process is described loosely by means of 
“operations” it is difficult to be precise in defining quantities which appear in 
the calculation, but when a notation is used which is definite enough to permit 
expansion in terms of the original coefficients, some sort of proof may be avail- 
able. The present notation bears some resemblance to that suggested by 
Gauss [4], though Gauss used letters to indicate the primary subscripts and 
numbers to indicate the number of secondary subscripts and his notation was 
directly applicable to the sums of least squares theory rather than to symmetric 
equations in general. 

We wish to find the solution of the equations 


. 


(1) >> ais 2: = Qn+1,j) q = i, a 


t=1 


where the matrix of the coefficients is symmetric. We do this by obtaining 
auxiliary equations which feature a decreasing number-of variables. No serious 
restriction is made if we assume that the variables 21 , 22 , x3 , etc., are eliminated 
successively. The Doolittle technique may then be described as follows: 

We take the first equation of (1) and divide by its leading coefficient, ay , to get 
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(2) ; > bi y= brats, where bi = ea 


> 
i=l an 


and we then form 
n 
(3) 2D Qi2.1Xi = An41,2-1 with aiz.. = diz — iby. 
re 
We then divide by ag., and get 
n 
4) Dy bieati = bntr2-1 with te. =. 


i=l 22.1 
We next form 


n 
(5) p Qi3.12%i = An+1,3-12 With diz.12 = dis — Gibsi — Gi2.1 b32.1, 
i=l 


and 


n 
° Qiz.12 
(6) r bis.12%% = bn41,3-12 with biz.12 = : 


ian] 33.12 


This process is continued so that, in general, we have 
n 


Zz Gij.12-.-7-1 Ti = An41,7.12---j7-15 1,2, °°: 


t=1 

n 
; bij-12.. 7-1 Li = Dn41,j-12---j-15 ] 1, 2, OEe ome 
t=1 


= Qadir — Qi2.1b j2.1 = i312 53-12 = 


= Qi, j-2.12...j-3D;,;-2.12.--j-3 ver Qi, j-1-12..-j-2D 7, j-1-12-- -j-2 


(10) 


ee a Qij.12..-j— 
Qjj-12.. -j- 

It is to be noted that the n equations (1) are transformed by this process to 
the n auxiliary equations of (7) or (8). The solutions of (1) are also solutions 
of these auxiliary equations since the auxiliary equations are linear combinations 
of (1). It is our purpose to show that the prediagonal coefficients of these 
auxiliary equations are always 0 so that these auxiliary equations feature a 
decreasing number of variables. 

We may use the term primary subscripts to indicate the first two subscripts 
and the term secondary subscripts to indicate the later subscripts which specify 
the order of elimination of the variables. The ‘“‘order’’ of the coefficient is then 
equal to the number of secondary subscripts. 
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The formula (9) gives the matrix of the final Doolittle set of equations. At 
each stage of the reduction one can write down a formula for all the elements 
in the matrix at that stage. Thus one can write the coefficients of order h, 
Q;j.12..-h, in terms of coefficients of order less than h, 

Aijei2---h = Aizi — Qadir ‘ais Qi2.1b j2.1 a 


(11) 


— Gi ,h—1012.. h—2b j,h—1012- - h—-2— Gihe12.. n—1D jhel2-+-h—1 + 
It follows at once that 


Qijei2..-h = Dijei2..-h—-1 — Gin.12..-h—1 Ojh-12.. -h—1 


(12) 


Qih-12---h—1 Ujh-12..-h—1 


= Gijeg...4—1 
Qhh-12++-h—1 


4. Some theorems on the interchangeability of subscripts. Our main objec- 
tive is to prove that the prediagonal terms are zero. In order to do this we first 
prove some theorems dealing with the primary and secondary subscripts. 

THEOREM 1: The value of ai;...., is not changed if the primary subscripts are 
interchanged. ‘This theorem which might be stated ‘“The matrix of the coeffi- 
cients of a given order is symmetric”’ follows from the symmetry of the matrix 
of coefficients of zero order. We can show that the symmetry of the matrix 
having coefficient of order h follows at once from the symmetry of the matrix 
having coefficients of order h — 1 by comparing the value a;;...., with that of 
a;...., Obtained by dual substitution in (12). Since the matrix of zero order 
coefficients is symmetric by hypothesis, it follows that the matrices of the 
coefficients of order 1, 2, 3, 4, etc., are in turn symmetric. 

THEOREM 2: Any pair of consecutive secondary subscripts may be interchanged 
without changing the value of the coefficient. This theorem indicates that, within 
prescribed limits, the order of elimination does not have any effect on the result. 

Consider the coefficient a;;....4... having r secondary subscripts before the 
k and s secondary subscripts after the / and consider the corresponding coeffi- 

:je-. lk... Which results from an interchange of k andl. These coefficients 
can be expressed by continued use of (12) in terms of coefficients of order r + 2. 
The resulting expansion of a;;......... is equivalent to that of a;;....%... with the 
interchange of the 1 and the k. It follows that the theorem is true if a;;..... = 
aij;....44. Now a double application of (12) to a;;....% leads to the expansion in 
terms of coefficients of order r (using the notation a;;, to indicate the coefficient 
of the r-th order) 


Dike Vike Ajke Dike 
a es A 
ike Ajke kke kke 
(13) Qije...kt = ize — - ' 


Akke ike 
au. — 
Akke 
Then a;;..... is expanded similarly, the difference is formed and found to be zero. 
It follows that the theorem is true. 
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The application of Theorem 2 with the continued interchange of successive 
secondary subscripts in all possible ways leads at once to 

THEOREM 3: The secondary subscripts may be interchanged in all possible ways 
without changing the value of the coefficient. This theorem might be stated “The 
value of the resulting coefficient is independent of the order of elimination.” 
This is the sort of result one would expect to find and indeed, some may feel that 
it is intuitively evident, but this formal proof is presented for those who desire 
a more rigorous appreach. 

Theorem 3 enables us to prove Theorem 4 which may be stated: The value of 
Qj ;.12.... 18 always zero if at least one of the secondary subscripts is equal to one of 
the primary subscripts. 

Suppose 7 is this subscript. Then by Theorem 3, 7 may be placed in the final 
position. Now by (12) we have 




















Aije... 
- Cig... = Cy... = 














0 


A similar statement holds if 7 appears among the secondary subscripts. 





5. The vanishing of the prediagonal entries. As an application of Theorem 4 
we can show that the prediagonal entries are identically zero and this is exactly 
what is needed to establish the validity of the forward Doolittle process. It is 

to be noted that the prediagonal entries are of form a;j.12...;-1 With z <j. Then 

2 must equal one of the secondary subscripts and the term is zero. 

It follows that no entries need be made to the left of the diagonal in the 
Abbreviated Doolittle solution and, indeed, no entries need be made in the 
original matrix below the main diagonal. A numerical problem is presented in 
the next section. 














6. Illustration. The Abbreviated Doolittle technique is illustrated in Table 
II. This illustration is essentially an illustration of a previous article [2] and 
serves as the basis, in a later section, for expansion into the standard Doolittle 
solution. The check is shown in the right hand column and the back solution 
is indicated. The check entries for the first matrix are obtained by adding the 
entries in the row to the main diagonal and then adding the entries in the 
column. All other check entries are obtained by adding the entries in the row. 

The solution is easily made once it is understood and results from continued 
application of formula (9). For example 










54.123 = M54 — Asibay — Ase.1b42.1 — O53 1243.12 


and this is 


54.123 = .8000 — (.2000)(.6000) — (.3200)(.1905) — (.4619)(—.1612) = .6935 





(see the underscored entries of Table II). Terms of this sort are easily com- 
puted if a calculating machine, and especially so if one equipped with automatic 
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positive and negative multiplication, is available. The back solution too is 
easily accomplished with a machine. It is only necessary to substitute in turn 


; ° - 54.12 

in each of the “b” equations. Thus the value of x; is “~~ = bs.123 , the value 
44-123 

of 22 iS bs3.12 — b43.12b54.123 = D53.124, that of x3 is bse.1 — dae.bs4.123 — bs2.1bs3 124 = 

bs2.138, etc. The back solution of the check is treated similarly. 


7. A variation in technique. Before proceeding with the presentation of a 
standard Doolittle solution it seems wise to indicate another possible variation 
in the technique in addition to the division by the diagonal coefficient rather 
than its negative. It is possible to obtain the Doolittle solution by using the 
fixed entry from the first of the equivalent rows in place of using the fixed ‘“‘b”’ 
entry and the variable “a”. This results from the fact that 


(14) Qiks... Dye... = yk... Dus... (- Oi) . 


Akke- ee 
Thus in Table IT the value as4.123 can be obtained with the use of 
54.123 = Ass — Aaibs1 — Age.1bs2.1 — Caz .12D53.12 


as readily as with the use of 


54.123 = Asa — Asibsr — Ase.1b42.1 — se-r2b43.12 « 


See the boxed entries of Table II. 

There seems to be no real choice between these techniques. The fixed ‘“‘b” 
is traditional in the standard Doolittle solution while the abbreviation of the 
method of single division leads to a fixed ‘‘a”. The point to be emphasized here 
is that either the fixed ‘‘a” or the fixed ‘‘b” can be used. Also (14) is used in 
the next section in supplying details for the check portion of a standard Doo- 
little method. 


8. The standard Doolittle method. If no computing machine is available 
or if a more detailed solution is desired, it is preferable to record the individual 
products of (9) and thus arrive at the standard Doolittle method. (The division 
by the diagonal coefficient rather than its negative is not a fundamental differ- 
ence.) The standard Doolittle method, from this point of view, is an expanded 
form of the Abbreviated Doolittle method with more details added. Its validity 
then follows from the validity of the Abbreviated Doolittle method. While it 
is not true that all prediagonal terms vanish in the standard Doolittle method, 
and this fact complicates the check by row sums, yet: the prediagonal a;;.... 
(and b;;....) are all zero. 

The standard Doolittle method is presented in Table III. Some remarks 
should be made about the non-recorded terms, the two check solutions, and the 
back solution. 

The blanks (—) indicate non zero entries which are usually not presented in a 
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Doolittle solution. They should be considered however if the first check method 
is to be used. 

The first check method, which is the logical extension of the check method of 
the Abbreviated Doolittle solution, has been outlined by Ezekial [5]. The row 
sum is the sum of all the entries in the row whether recorded or not. In order 
to check, it is necessary to add these unrecorded entries, and they are available 


TABLE II 
Abbreviated Doolittle Solution; illustration 








UM L2 X33 v4 









1.0000 | .4000 5000 | 6000 


. 2000 

















































| 2.7000 
a 1.0000 3000 4000 | — 4000 2.5000 
a | 1.0000 2000 |  .6000 2.6000 
_ wt — | 1.0000 -8000 3.0000 
1.0000 40000 .5000 '. 6000) -2000 2.7000 
| | ee eects 
1.0000 .40000 5000 6000 |. 2000 2.7000 
8400 1000 1600] | — -3200 1.4200 
| 1.0000 .1190 1905 | — |.3810 1.6905 
ntincnnecesaniataty Pcticikadincsnaie’ 
' Sar eee oe acer. a = a 
.7381 | |—.1190| .4619 1.0810 
fenntoneeed = 
1.0000 —.1612 |.6258, 1.4646 
5903 6935 1.2837 
| 1.0000 1.1748 2.1747 
| 1.0000 8152 1.8152 
1.0000 .0602 1.0602 
1.0000 — .9366 0635 


in the columns above if we make use of formula (12). Thus, if we wish to check 
5 


the value yi Qiibs = 1.6200, we-have 


i=1 









AyD + Aeiba + AziD4, + Ayyba + Asib41 = 






Qs, + Asbo: + A4ibs1 + Aiba + Agiba = 
.§000 + .2400 + .3000 + .3600 + .1200 = 1.6200. 





Another check method, which is recommended by Peters and Van Voorhis [6] 
sums the entries in the row only over those columns which are to be recorded. 
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This is presented as check method 2 of Table III. Asis to be expected, the check 
values of the a’s and b’s of the last two rows of each matrix are in agreement. 

It might be noted that one may use the first check method without checking 
the intermediate steps (the sums for each row) if he checks the sums for the last 
two rows of each matrix. 


TABLE III 


Doolittle solution, with checks 


Check Check 
| Method 1 | Method 2 


z2 Z3 

.4000 | .5000 5 | .2000 | 2.7000 | 2.7000 

.0000 | .3000 | .4000 | 2.5000 | 2.1000 

— 1.0000 | .2000 .6000 | 2.6000 | 1.8000 

Gis — — | 1.0000 .8000 | 3.0000 | 1.8000 
ait .0000, .4000 | .5000 | .6000 .2000 | 2.7000 | 2.7000 
bis .0000' .4000 | .5000 | .6000 .2000 2.7000 | 2.7000 





dis — .0000 | .3000 .4000 .4000 | 2.5000 . 1000 
ainda — .1600 | .2000 . 2400 .0800 .0800 . 6800 
Qi21 .8400 | .1000 . 1600 . 3200 .4200 .4200 
Diot .0000 | .1190 | .1905 .3810 .6905 .6905 








iz / — 1.0000 | .2000 6000 | 2.6000 | 1.8000 
aids: — | .2500 .3000 1000 3500 6500 
Geatiies — | 0119 .0190 0381 1690 .0690 
Son | .7381 |—.1190 4619 0810 0810 
bien 0000 |— .1612 6258 4646 4646 





Qis 1.0000 . 8000 3.0000 . 8000 
Aiba) . 3600 . 1200 .6200 .4800 
Qi2- 10491 .0305 .0610 . 2705 .0914 
iz-12b43° 12 0192 |—.0745 .1743 .0553 
Gis: 123 | . 5903 .6935 . 2838 . 2839 





ines | | 1.0000 | 1.1748 | 2.1748 

isin 1.0000 [—1894 | .8152 1.81532 [—.3506 

Dis-1a4 .0000 | .0970 | .2288 | .0602 | 1.0602 | .4143 | .2160] 

bir .0000| -0241 | .4076 | .7049 |-.9366 | .0634 ] 1.3049 | .9076| .4241 


The back solution is carried out as in Table II. If no computing machine is 
available or if the detailed steps are desired they may be indicated as in Table 
Ill. The entries in the box under the 2, column are respectively b54.123b43.12 , 
bs4.123D42.1, ANd bs4.123b4.. Those in the preceding column are bs3.194b32... and 
bs3.124031. The other entry is bs2.134b2:.. The values of the coefficients are ob- 
tained by subtracting these row entries from the constant term of the corre- 
sponding “b’” equation. Thus, 653.124 = (.6258) — (—.1894); bseasg = 
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(.3810) — .0970 — .2238, etc. The back solution of check method 1 agrees 
with that of check method 2. A form for accomplishing the back solution of 
the check is indicated at the right. It is not necessary to complete the back 
solution of the check if it is not desired, and indeed, there are some who feel 
that the use of the row sum check is unnecessary with modern computing ma- 
chines [7]. The basic check is substitution in the original equations. 


9. Summary. The chief purpose of this paper is to show that the Doolittle 
technique actually leads to a set of equations featuring a decreasing number of 
unknowns. This is accomplished by the introduction of an appropriate notation 
to describe the process and the establishment of certain theorems which serve 
to validate the process. These theorems are of some interest aside from the 
application made here. It is a secondary purpose of this paper to emphasize 
the practicability and theoretical advantages (relative ease of calculating, theo- 
retically more accurate, less chance for numerical error, less recording, less time 
consuming, more compact, and more easily checked) of the Abbreviated Doo- 
little method and to explain and illustrate possible variations in technique in the 
forward and check (by row sums) portions of the standard Doolittle solution. 
It should be noted that the notation suggested is very useful in providing an 
easy development of various theorems used in multiple and partial correlation 
studies, the presentation of which is not the purpose of the present paper. 
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NOTES 


This section is devoted to brief research and expository articles, notes on methodology 
and other short items. 
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A PROBLEM IN ESTIMATION 


By JoserH F. Daty 


The Catholic University of America 


Several recent psychological studies in the field of memory testing [1], [2], [3] 
have suggested the following problem. Let each individual FE in our popula- 
tion be characterized by the variates y’, --- , y’;y’", ---, y”"' (p > 0). Sup- 
pose, however, that circumstances make it impossible for us to observe the last 
t variates. For example, we may think of y’, --- , y’ as an individual’s scores 
on a battery of tests, and think of y’™’, --- , y”*‘ as measures of certain psycho- 
logical characteristics which, though affecting the individual’s performance, are 
not subject to direct observation. To make up for this, assume that we have 
a theory which tells us that if y’™’,--- , y’*' are held constant, then the ob- 
servable y’s are dependent upon them according to a specified regression equation 


y’ = ay", Q=1,---,pu=ptl,-:-,pt+e). 
Somewhat more precisely, we assume the distribution laws 
(Q) fly's +++, y?") = ny *"? | A, |} exp {[-3An(y" — ay’ — a’)}, 


(where r, s = 1, --- , p + t, and repeated indices are to be summed according 
to the usual convention) and 


| er p+ 2\—}p 1 i i 2 
(2) fy sess y" ly?) 9?) = One®) exp {— 52 Y - ayy}. 


The zx; are supposed to be known, but except for the conditions imposed by (1) 
and (2) nothing is known about the quantities A,,, a’, ando’. Having observed 
the test scores y, (a = 1,---, N) obtained by N individuals E, drawn at 
random from the population, we wish to estimate the values y2™,--- , y2*' 
corresponding to each E, , and the essential parameters in the distribution law 
(1), particularly the variances and covariances of y’™’, --+ , y?**. 

We can easily find optimum estimates of the y2 by applying the method of 
maximum likelihood to the function (2) after substituting for the y‘ the scores 
y, obtained by the individual in question. Thus if we write 
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Ow = ie, ) | v a || Vuv rs 


(assuming thereby that the rank of the matrix || 2} | 


mI 


is t) we have 


(3) Wee = of riy’, ‘ 


These estimates are unbiased in the sense that the expected value of 7 calculated 
from the distribution law (2) is y”. 

But when we come to estimate the variances and covariances involved in (1), 
the procedure is less straightforward. Under the present circumstances we 
cannot use the expression 


| v —Vv 
(4) yoy & Ye - Ye - 9), 


for the sample covariance of y“ and y’. We might, of course, try substituting 
the estimates 7, from (3) for the unknown y* in (4). But this expedient will 
in general produce a biased estimate. Denoting the required covariance by 
A“ (the element in the appropriate position in the inverse of the matrix || A,. ||), 
we find as a matter of fact that the expected value of (4) when the y% are re- 
placed by their estimates 7, is 
(5) 
This bias may or may not be important in any given case. But it can conceiv- 
ably be quite serious if the A” are relatively small, especially if such expressions 
are employed in the usual way to estimate the correlation coefficient rather than 
the covariance. 

Perhaps the most logical way to attack the problem is through the joint 


A” + ov”. 






, i - 1 > ‘ cs . ° 

distribution of y,---, y” alone, obtainable by integrating the undesirable 
: +1 +t , iz 

variates y”’,---, y” out of (1). We therefore consider 


(6) = f(y’, +++, y”) = (24)? | Ai; | exp {-44i,(y' — a')(y’ — @’)}, 
where 
Ai; = Ai; — AwBYA,;, — || BY || = || Aw II. 
Moreover, when account is taken of (2), we find that we must have 
Ai; —_ Oi 
o~ o~ 
(55; being Kronecker’s delta). If we now form the likelihood function 
Tsu ,°+*, ye) from (6) for our sample, and set its derivatives with respect 


2 ° ° nm ° 
to the a”, o, and the B’”’, equal to zero, we arrive, after some simplification, at 
the equations 


a =a = yi: [ef. (3)] 
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14° -1¥ Wi - aera - sta} 5, = 0, 


aj 1 a a 1] , a j 
14 _ W > (yi. — zia")(yi, — ria } rex; = 0, 


AY =@ 6" + 27, A" 2}, 


for determining the maximum likelihood estimates. The first of equations (7) 
is already solved for the a”, and the solution of the simultaneous equations for 
the remaining essential parameters yields the estimates 


1 é i au \2 
(8) — N@-b dX (Yo — Va) 


() i” = DG — ANG — @) — ve. 


A considerable amount of algebraic manipulation is required to put the solu- 
tions in the form given above; but since the results are about what one would 
expect in view of (5), we omit the details. As is often the case, some bias re- 
mains in the “optimum” estimates (9). However, this can be eliminated by 
writing N — 1 in place of N. The estimate (8) of o’ is unbiased as it stands. 
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CONFIDENCE LIMITS FOR AN UNKNOWN DISTRIBUTION FUNCTION 
By A. KoLMOGOROFF 
Moscow, U.S.S.R. 


Let 21, 2, +--+ , Zn be mutually independent random variables following the 
same distribution law 


(1) Pir; < &} = F(). 
A recent paper by A. Wald and J. Wolfowitz’ deals with the problem of using 


1A. Wald and J. Wolfowitz, ‘Confidence limits for distribution functions,’’ Annals of 
Math. Stat., Vol. 10 (1939), pp. 105-118. 
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the observable values of the x’s to estimate the function F(¢). In this connee- 
tion it may be useful to recall the following results published by me in 1933, 
Put 


(2) PA = & 


where N(é) denotes the number of those z’s whose observed values do not 
exceed &. 

THEOREM 1: If the function F(£) is continuous then the distribution law of the 
quantities 


(3) D, = sup | F(t) — Fyl(é) |-V/n 
does not depend on F(é). 

Denote by ®,(A) the value of the probability P{D, < } which is common 
to all continuous distribution functions F(é). 

THEOREM 2: For n tending to infinity, the distribution function ®,() tends to 


(4) e0) = Yo (-vte™™ 


uniformly with respect io X. 
A more elementary proof of Theorem 2 was given by N. Smirnoff in 1939.' 
Another paper by the same author’ gives a table of the function (a). 
Without the assumption that F(£) is continuous, we easily obtain 
THEOREM 3: Whatever be the distribution function F(&), 


(5) P{D, <A} > ®,(A). 


Theorems | and 3 giving the exact lower bound of the probability that F,,(£) 
will satisfy the inequality 


r 
(6) | F(é) es F,,(&) | < a. 
| | a/n 
for all values of £ can be used to establish confidence limits for F(£) corre- 
sponding to the confidence coefficient 


(7) a = ®,(A). 


These confidence limits will be free from any restriction concerning the nature 
of the function F(é). 


2 A. Kolmogoroff, ‘Sulla determinatione empirica di una legge di distributione,’’ Giornale 
dell’ Istituto Italiano degli Attuari, Vol. 4 (1933), pp. 83-91. 

3N. Smirnoff, ‘“‘Sur les écarts de la courbe de distribution empirique,’’ Recueil Math. de 
Moscou, Vol. 6 (1939), pp. 3-26. 

4N. Smirnoff, ‘‘On the estimation of the discrepancy between empirical curves of distri- 
bution for two independent samples,’ Bulletin de l’ Université de Moscou, Série internationale 
(Mathématiques), Vol. 2, fase. 2 (1939). 
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For sufficiently large values of n we can use the limiting distribution (4) and 
write 


(8) a = P(A). 


The following short table, based on that of Smirnoff,’ gives the. values of \ 
corresponding to a few chosen confidence coefficients a. 


TABLE OF 






a 


95 


.98 | 1.52 
.99 | 1.63 
.995 1.73 
.998 1.86 
.999 1.95 





Smirnoff’s paper’ contains still another application of the function P(A). 
Denote by ry : 9 eee, Ln, and 1, re, Fees Ens two sequences of mutually inde- 
pendent random variables following the same probability law f(g). Let further 
F,,(€) and F,,,(€) be two random step functions corresponding to these series, 
defined as in (2). Smirnoff proves then the following 

THreoreM 4: If the probability law F(&) is continuous, then the probability 


(9) P {sup | Fn, (€) — Fa (é) | <A fmm = Pn, np (A) 


NN 


is independent of the function F(é). If n and nz are indefinitely increased subject 
to the restriction that the ratio ni/n2 remains between two fixed numbers a, and az 





(10) 0<a<"<a<+ 


then 
(11) Pp, .ny(A) — P(A). 


In the general case, where the probability law F(&) is absolutely arbitrary we have 


(12) P{sup | Fn, (€) — Fr(€) | SA o/s SZ Paz nq(A). 
Ne 
Owing to the above results the quantity 
at _ M1 Ne 
(13) Days = SUP | Foy(@) — Plt) | 4/ 


could be used as a criterion to test the hypothesis that the probability laws of 
the two series of observable variables are actually the same. 
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CORRECTIONS TO A PAPER ON THE UNIQUENESS PROBLEM 
OF MOMENTS 


By M. G. KENDALL 


London, England 


I wish to make certain corrections in my paper on “Conditions for Unique- 
ness in the Problem of Moments” (Annals of Math. Stat., Vol. 11 (1940), p. 402). 
I thought I had succeeded in improving on results given earlier by Stieltjes, 
Lévy and Carleman, but this is not so. 

Theorem 1 of the paper stated that a set of moments determines a distri- 


°c r 
. . ; vy . 
bution uniquely if >> — converges for some real non-zero t, v, being the absolute 
r=0 , 3 


moment of order r. This is true, and a similar result has been proved by Lévy, 
but my proof contained a small lacuna. It was shown that the characteristic 
function ¢(¢) has a Taylor expansion which, under the conditions of the theorem, 
is convergent; but it has also to be shown that it is equal to the sum of that 
expansion. This may be seen as follows: 

We have 


| tx |" 


| 
| 


and hence, on taking mean values, 


n—1 *4\r n 

ut i bet 
ie ~ Fe gee 
| r=0 rs | nN: 


n 


: 3: Dut ; 
Since by hypothesis ag ae 0, o(t) must be equal to the sum of its (convergent) 
n! 


Taylor expansion. 

The principal error was a statement that y\/"/n must either tend to a limit or 
diverge. For this reason, the second theorem should run: a distribution deter- 
mines a distribution uniquely if lim y/"/n is finite (not lim yi!" /n as originally 
stated). Theorem 3 should also be restated with the upper limit substituted 
for the limit therein. 


Theorem 4 stated that a set of moments uniquely determines a distribution 
if = ifs diverges. A rigorous proof is as follows: 
The characteristic function obeys the relation 
lo"(t)| Sm, n>] 


provided, of course, that v, exists. A theorem of Denjoy’ states that if a func- 
tion f(x), defined in the segment (a, b), possesses derivatives of all orders therein, 


1Arnaud Denjoy, “‘Sur les fonctions quasi-analytiques de variable réelle,’’ Comptes Ren- 
dus Vol. 173 (1921), p. 1399. 
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if M, is the maximum of | f‘” (x) | in the segment and if > is divergent, 


1 
Mi 
then f(x) is completely determined by its value and that of its derivatives at a 
single point. (t) obeys the conditions of the theorem and by taking the point 
to be t = 0, theorem 4 follows. 

I hope that this note will correct any misunderstandings that may have arisen 
on the main paper, and I regret that a number of circumstances, not the least 
of which is war, have made it impossible to forward the correction at an 
earlier date. 


ANNOUNCEMENT CONCERNING COMPUTATION OF 
MATHEMATICAL TABLES 


In the December, 1939, issue of the Annals of Mathematical Statistics, p. 399, 
there appeared an Announcement of the Mathematical Tables Project. This 
project is operated by the Work Projects Administration of New York City, 
as O. P. No. 265-2-97-11 under the technical supervision of Dr. A. N. Lowan. 
It is sponsored by the National Bureau of Standards, Dr. Lyman J. Briggs, 
Director. 

In order to keep the readers of the Annals up-to-date on the progress of the 
work of the Project, information will be released from time to time. 

The following list shows the status of work, as of October, 1941. The reader 
is referred to the December, 1939 issue of the Annals with respect to which n 
will denote the n‘* item of Tables Published, Pn will denote the n** item of 
Tables in Progress and Cn will denote the n* item of Tables under Consideration. 


Tables published. 1, 2, 3, Pl, P2, P3, P4, P6(b), P6(c), P6(d), P6(e), P7, 
C7 and also 

1. Table of Five-Point Lagrangian Interpolants for arguments ranging be- 
tween 0 and 2 at intervals of 0.001. 

2. Tables of Grid Coordinates (American Polyconic Projection) at 5 minute 
intervals of latitude and longitude for latitude from 70°N to 28°N and for lati- 
tude from 49°N to 72°N. 

3. Table for Map Projections of Northwestern Extension of U. 8. 


Tables in process of reproduction. P5, P6(a), P8 and C1 for [0 (.001) 7 (.01) 
50 (.1) 300 (1) 2,000 (10) 10,000; 12D] also 

1. Tables of Section Moduli and Moments of Inertia for Structural Members 
used in Naval Architecture. (For the Bureau of Marine Inspection and 
Navigation. ) 

2. Tables of Si(x) and Ci(x) for x ranging from 10 to 100 at intervals of 0.001. 
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3. The zeros of the Legendre Polynomials up to the 16th order to 15 decimal] 
places and the Weight Coefficients for Gauss’ Mechanical Quadrature Formula, 


Tables for which manuscripts are completed. P9, P11, C6, (the function 2’, 
instead of A(z, y), has been tabulated to 15 places), and also 


1. Table of | Jo(t) dt from 0 to 10 at intervals of 0.01 to 10 places. 
0 


Tables for which computations are completed. P10 (also tanh z, coth 2), 
C2, C3, (change to n = —21, —20--- 0) and also 

1. Various hydraulic tables based on Kutter’s and Manning’s formulae, 
(Tabulation suggested by the War Department.) 

2. Table of reciprocals of the integers from 100,000 to 200,000. 

3. Table of the Associated Legendre Functions P7 (x) and Q7'(x) for n ranging 
between 1 and 10, and m between 0 and 4; for arguments x and 7x where z 
ranges between 0 and 10 at intervals of 0.1. Also corresponding values for half- 
integral values of n and values of the functions for arguments in degrees. (Tabu- 
lation suggested by National Defense Research Committee.) 

4. Tables of R sin 6 and R cos 6. R = 1000 (10) 10,000, 6 = 5(5)800 (in 
mils). 


Tables for which computations are in progress. C3 (for n = 1, 2, --- 20) 
and also 

1. Table of the Bessel Functions Yo(z) and Y;(z) for the same complex argu- 
ments as in Jo(z) and J;(z), mentioned in P9. 

2. Tables of Length of Meridional Are at one-minute intervals. 

3. Tables of the Confluent Hypergeometric Function for selected values of 
the parameters. 

4. Tables of three-point, four-point, six-point and seven-point Lagrangian 
Interpolants. 

5. Table of Tchebysheff Polynomials. 


Tables under consideration. C4 and also 

1. Table of the first 10 powers of the reciprocals of the integers from 1 to 1,000. 

2. Extensive tables of Elliptic Functions for both real and imaginary 
arguments. , 

3. A 12-place table of Inverse Circular and Hyperbolic Functions other than 
Are tan 2. 


4. Table of the Integral I Yo(t) dt. 
0 


5. Tables of the non-periodic solutions of the Mathieu Differential Equation. 

6. Table of the Error Functions for complex arguments (suggested by Federal 
Communication Commission). 

7. Tables of the Unit-Sigma Functions and their integrals. 
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8. Tables of Circular Functions for Complex Arguments. 

9. Tables of the Zeros of the Hermite and Laguerre Polynomials and of the 
corresponding Weight Factors in Gauss’ Mechanical Quadrature Formula. 

10. Table of Lamé Polynomials. 

11. Table of Military Grid Coordinates for certain ‘‘Control Stations.”’ (For 
the War Department.) 

12. Tables of the Chi-Square Distribution and “Student’s” ¢-distribution. 

13. Tabulation of Fisher’s A-, B-, and C- Distributions of the Multiple Correla- 
tion Coefficients. 

The Project would welcome suggestions for the computation of new tables of 
interest in pure and applied mathematics, as well as information regarding com- 
putational work in progress elsewhere. 

Communications should be addressed to Major Irving V. Huie, Administrator, 
Work Projects Administration, 70 Columbus Avenue, New York City. 

Requests for copies of published tables should be addressed to Dr. Lyman J. 
Briggs, Director of the National Bureau of Standards, Washington, D. C. 





REPORT OF THE CHICAGO MEETING OF THE INSTITUTE 


The Fourth Summer Meeting of the Institute of Mathematical Statistics was 
held at The University of Chicago, Tuesday to Thursday, September 2 to 4, 
1941, in conjunction with the meetings of the American Mathematical Society, 
the Mathematical Association of America, and the Econometric Society. The 
following sixty-eight members of the Institute attended the meeting: 


R. L. Anderson, T. W. Anderson, K. J. Arnold, H. M. Bacon, Walter Bartky, W. D. 
Baten, A. A. Bennett, Paul Boschan, I. W. Burr, J. H. Bushey, W. E. Cederberg, W. G. 
Cochran, A. T. Craig, C. C. Craig, J. H. Curtiss, J. F. Daly, W. E. Deming, J. L. Doob, 
P. L. Dressel, P.S. Dwyer, Churchill Eisenhart, M. L. Elveback, H. P. Evans, C. H. Fischer, 
W. C. Flaherty, R. M. Foster, C. H. Graves, Louis Guttman, W. L. Hart, F. C. Hinds, 
A.§S. Householder, E. V. Huntington, William Hurwitz, M. H. Ingraham, Dunham Jackson, 
Leo Katz, J. F. Kenney, L. A. Knowler, L. F. Knudsen, Tjalling Koopmans, C. F. Kossack, 
O. E. Lancaster, D. H. Leavens, B. A. Lengyel, W. G. Madow, J. N. Michie, A. M. Mood, 
J. E. Morton, Leah Naugle, Harold Nisselson, J. I. Northam, E. G. Olds, Oystein Ore, 
C. K. Payne, G. A. D. Preinreich, Francis Regan, Selby Robinson, C. F. Roos, M. M. 
Sandomire, Max Sasuly, Henry Scheffe, H. M. Schwartz, Harry Siller, J. H. Smith, M. E. 
Wescott, S. S. Wilks, E. W. Wilson, Gale Young. 


The opening session, on Tuesday morning, was devoted to contributed papers 
on Probability and Statistics and was held jointly with the American Mathe- 
matical Society and the Econometric Society. The Chairman was Professor 
A. T. Craig, University of Iowa, and the following papers were presented: 


1. A geometric derivation of Fisher’s z-transformation. 
J. B. Coleman, University of South Carolina. 
2. Large sample distribution of the likelihood ratio. 
Abraham Wald, Columbia University. 
3. On the integral equation of renewal theory. 
(Read by title.) 
Willy Feller, Brown University. 
. Cumulative frequency functions. 
Irving Burr, Purdue University. 
. On spherical probability distributions. 
K. J. Arnold, Massachusetts Institute of Technology. 
. Some observations on analysis of variance theory. 
(Read by title.) : 
Hilda Geiringer, Bryn Mawr College. 
7. On the asymptotic distribution of medians of samples from a multivariate population. 
A. M. Mood, University of Texas. 
8. A problem of estimation. 
J.F. Daly, Catholic University. 
Abstracts of these papers follow this report. 


On Tuesday afternoon a session was held jointly with the Econometric Society 
on Time Series Analysis. Under the chairmanship of Professor C. C. Craig of 
the University of Michigan, the following papers were presented: 
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1. Is sampling theory applicable to economic time series? 


Tjalling Koopmans, Penn Mutual Life Insurance Co., Philadelphia. 
2. Serial correlation. 


R. L. Anderson, North Carolina State College. 


The morning session on Wednesday was held jointly with the Econometric 
Society on Curve Fitting. The chair was held by Dr. J. Marschak of the New 
School for Social Research and the following papers were presented: 

1. Weights to compensate for transformation in curve fitting. 


T. O. Yntema, University of Chicago and Cowles Commission. 
2. Curve fitting by cumulative addition. 


John H. Smith, University of Chicago and Cowles Commission. 


On Wednesday afternoon, Professor 8. 8S. Wilks of Princeton University acted 
as chairman of a session on Multivariate Analysis. The following papers were 
read: 


. On testing sets of means and discriminant analysis. 
Abraham Wald, Columbia University. 
2. On tests of hypotheses concerning variances and covariances. 
William G. Madow, Bureau of the Census. 


The Josiah Willard Gibbs Lecture of the American Mathematical Society was 
delivered on Wednesday evening by Professor Sewall Wright of the University 
of Chicago. His topic was Statistical Genetics and Evolution. 


On Thursday morning a joint session on Demand and Supply Analysis was 
held with the Econometric Society. At this session Dr. C. F. Roos of the In- 


stitute of Applied Econometrics presided, and the following papers were 
presented : 


. Demand analysis for certain commodities based on income and budget data. 
J. Marschak, New School for Social Research, and George Garvey, National Bureau 
of Economic Research. 
. Derivation of elasticities of demand and supply: A direct method. 
Oscar Lange, University of Chicago and Cowles Commission. 
. On the workings of a general equilibrium system. 
J. L. Mosak, University of Chicago and Cowles Commission. 


informal reception was held on Monday evening in the Judson Court 
Lounge. On Tuesday and Wednesday afternoons the ladies of the Mathematics 
Department of the University of Chicago served tea in the Eckhart Hall Common 
Room. After the joint session on Tuesday afternoon, the Cowles Commission 
for Research in Economics gave a tea in the Common Room of the Science 
Building. On Thursday evening a joint dinner of the four mathematical organi- 
zations was held in Hutchinson Commons, preceded by an informal reception 
at the Reynolds Club. 

EpwIin G. OLDs, 
Secretary 





ABSTRACTS OF PAPERS 
(Presented on September 2, 1941, at the Chicago Meeting of the Institute) 


A Geometric Derivation of Fisher’s z-transformation. J. B. CoLEMAN, Uni- 
versity of South Carolina. 


In fitting points in a plane by a line so that the sum of the squares of the perpendicular 
deviations shall be a minimum, a second line is found for which the sum of the squares of 
the deviations is a maximum. Let 2d? be the sum of the squares of the deviations of the 
points from the minimum line, and 2D? be the sum of the squares from the maximum line. 
Then >D?2/Ed? = (l+r)/(1—,r). 4 log(1+7r)/(1 —1) is Fisher’s z-transformation for test- 
ing the coefficient of correlation. 


Large Sample Distribution of the Likelihood Ratio. ABRAHAM WALD, Columbia 
University. 


The large sample distribution of the likelihood ratio has been derived by S. S. Wilks 
(Annals of Math. Stat., Vol. 9 (1938)) in case of a linear composite hypothesis and under 
the assumption that the hypothesis to be tested is true. Here a general composite hy- 
pothesis is considered and the distribution in question is derived also in case that the 
hypothesis to be tested is not true. Let f(z: , +--+ , 2p , 01, °*+ , 0%) be the joint probability 
density function of the variates 2; , --- , 2» involving k unknown parameters 6; , «++ , &. 
Denote by H., the hypothesis that the true parameter point 6 = (01, --- , 0.) satisfies the 
equations £,(0) = --- = &(@) = 0, (r < k). Denote by d, the likelihood ratio statistic for 
testing H., on the basis of n independent observations on 2; , --- , Zp». For any parameter 
point @ let &;(0) = aie and let ci;(@) be the expected value of PNT te 

2 v 
Stefan te calculated under the assumption that @ is the true parameter point. 
2 
For any @ denote by A(@) the matrix || &;(@) || (@ =1,---,r;j =1,--- ,k) and let 
|| oi;(0) || = || ccj(@) |, @, 7 = 1, +--+, k). Let furthermore || o%,(@) ||, (u,v =1, +--+,” 
be the matrix equal to the product A(6)-|| 0:;(@) ||-A(@), where A(@) is the transpose of 
A(@). Finally let || é%.(@) || = || o%.(@) ||", (u,v = 1,-+-,7r). For each n and @ denote 
by yin(O), «++ 5 Yrn(@) a set of r variates which have a joint normal distribution with mean 
values +/néi(0), --- , ~/né-(@) and covariance matrix || o%,(0) ||, (u,v = 1,---, r). De 


note the quadratic form 7. > Yun (9) Yon(O)E*, (0) by Q,(@). It has been shown that under 
v=lu=1 ° 


certain assumptions on f(z: ,--+ , 2p, 9), &:(0), «+ , &(@) we have lim {P(—2 log m& < 


t | 0) — P[Q,(@) < t | 6]} = 0 uniformly in ¢ and @, where for any z P(z < t | 6) denotes the 
probability that z < ¢ holds under the assumption that @ is the true parameter point. The 
distribution of Q,(@) is known and has been treated in the literature. If H. is true, then 
£:(0) = --- = &(0) = 0, and Q,(6) has the x? distribution with r degrees of freedom. 


On the Integral Equation of Renewal Theory. W. Freier, Brown University. 


t 
As is well-known, the equation U(t) = G(t) + | U(t — x) dF (zx) has frequently been 
9 


470 





ABSTRACTS OF PAPERS 471 


discussed, under different forms, in connection with the population theory, the theory of 
industrial replacement, etc. In the present paper it is shown that, using Tauberian 
theorems for Laplace integrals, it becomes possible to analyze in detail the asymptotic 
behavior of U(t) ast — « and also to solve some other problems which have been discussed 
in the literature. Strict conditions for the validity of different methods to treat the equa- 
tion are given together with some modifications found to be necessary. The paper will 
appear in the Annals of Mathematical Statistics. 


Cumulative Frequency Functions. I. W. Burr, Purdue University. 


Frequency and probability functions play a fundamental role in statistical theory and 
practice. They are, however, often inconvenient and difficult to use, since it is necessary 
to integrate or sum to find the probability for a given range. Theoretically the cumulative 
or integral frequency function would seem to be better adapted to determining such prob- 
abilities, since the latter can be found simply by a subtraction. The aim of this paper is 
to make a contribution toward the direct use of cumulative frequency functions. Some 
general properties and theory of cumulative functions are presented with particular empha- 
sis upon certain moment functions adapted to such direct use. Both continuous and dis- 
crete cases are included. A list of possible cumulative functions is given and a particular 
one, F(x) = 1 — (1 + 2*)~*"!, discussed fully. This function has properties which make it 
practicable and adaptable to a wide variety of distribution types. It well illustrates the 
possibilities of the cumulative approach. 


On Spherical Probability Distributions. KENNETH J. ARNOLD, Massachusetts 
Institute of Technology. 


Two methods of correspondence for circular distributions to the normal error function 
have led to non-constant absolutely continuous functions [See F. Zernike’s article in Hand- 
buch der Physik Vol. 3, pp. 477-478]. The corresponding distributions for the sphere are 
found. The case of diametrical symmetry for both circle and sphere is discussed. Tables 
of the probability integrals involved are given and an application in geology is included. 


Some Observations on Analysis of Variance Theory. Hi~tpa GEIRINGER, 
Bryn Mawr College. 


The test functions used in analysis of variance present themselves in different classes 
of important problems. Their distribution has been determined and tabulated by R. A. 
Fisher! under the hypothesis that the chance variables are all independent of each other and 
subject to the same normal law. Consequently we can in this way test only the hypothesis 
that the theoretical populations have all these properties. 

If it is not possible to determine the exact distribution of test functions under sufficiently 
general assumptions regarding the populations we may: (a) find an asymptotic solution of 
the problem, i.e. determine the distribution of the test functions for large samples.2 Or (b) 
determine at least the mathematical expectations and the variances of the test functions 
for appropriately general populations and for small samples. 

It is well known that the expectations of the two quadratic forms which are basic in the 
analysis of variance are equal, even if the n populations are not normal but equal to each 
other (Bernoulli series). But, in addition, we can prove the mathematical theorem that, 
under the same conditions the expectation of their quotient equals one. The next step con- 
sists in studying the case that the n distributions are not equal to each other and to investi- 
gate certain inequalities characteristic for the Lexis Series and Poisson Series. These 
different criteria are completed by the computation of the variances of the test functions. 


1 “Metron,” Vol. 5 (1926), p. 90-104. 
2 See e.g. W. G. Madow, Annals of Math. Stat., Vol. 11 (1940), p. 193. 
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In addition to the above mentioned test functions known as “variance within” and 
‘“‘variance among”’ classes other symmetrical test functions have been considered in the 
classical analysis of variance. Here again we may assume quite general populations. |t 
results that the Lexis as well as the Poisson Series may now be characterized by equalities 
(instead of inequalities). 

Finally it seems to be worthwhile to omit the assumption of independent chance variables 
and to study different kinds of mutual dependence. These investigations lead to new ip- 
structive inequalities among the expectations. These last considerations seem to be con- 
nected with Fisher’s “‘intraclass correlation’’ and to supplement this idea. 


On the Asymptotic Distribution of Medians of Samples from a Multivariate 
Population. A. M. Moon, University of Texas. 


Let two variates xz; and x2 have a density function f(z: , 2) which, besides being positive 
or zero and having its integral over the whole space equal to one, shall satisfy these con- 


ditions: 
ao] / 1 0 1 
[ z («. ’ *) dz; [ f(x ; 0) dx, oe O 6) 
0 n Eas n 
20 1 2 1 
[ (5, ns) dz, [ S(O, X2) dx +O (*) 
— OO n =.) n 


The coordinate system is assumed to have been chosen so that the population median is at 
the origin. Let (%: , Z2) be the median of a sample of 2n + 1 elements drawn from a popula- 
tion with this density function. It is shown that for large samples (%: , %2) is normally 
distributed to within terms of order 1/+/n with zero means and variances and covariances 
given by certain integrals of f(x , 22). 

A similar result is true for k as well as two variates. 





A Problem in Estimation, Josep F. Day, The Catholic University of America. 


Consider a normal population in which each individual is characterized by the variates 


Y1,°** 5 Yp» Yps1 5 Yor? Suppose that the latter two are not directly observable, but that 
for given values of yps1 , Yp+2 the first set of y’s is independently distributed about the 
“regression line”? yz = Yps1 + kypi2 (kK = 1, --+ , p) with a common variance o?. For each 
individual, one can thus determine values §).1 , %p.2 from the observed y: , --- , Yp , using 
the method of least squares. Assuming a similar relation between the expected values of 
Y15°** 5 Yps2 in the original population, these estimates 9.1 , Jp+2 are, of course, unbiased. 


However, if we calculate these 9’s for each individual of a sample of NV, and substitute them 
in the Pearson product-moment correlation formula, the estimate of the correlation be- 
tween Yp.1 and yp,2 thus obtained is somewhat biased. The bias depends on the number of 


observable y’s, and on the size of the variances and covariances of Ypi1 , Ypi2 relative to o’. 


Is Sampling Theory Applicable to Economic Time Series? T. J. Koopmans, 
Penn Mutual Life Insurance Company. 


The classical regression theory assumes that the values of the independent variables 
remain the same in repeated samples. Certain situations in economic analysis, like price 
formation according to the ‘‘cobweb”’ theorem, require a sampling theory of serial regression 
in which certain observations may represent a dependent variable at one time and an inde- 
pendent variable at a later time. This leads to the problem of the joint distribution of 
certain quadratic forms in normal variables. 

The simplest problem of this type is that of the distribution of the ratio r = q/p of a 
quadratic form g in 7 observations from a normal distribution with mean 0 to the sum p 
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of the squares of these observations. The distribution of r is independent of that of p 
and is 





i” _ — r)iT—2 
Stil an $T f 7 r) , de, 
ie , 


2Qri ‘Hy ae to 


¢=] 


where the k; are the characteristic values of qg, while the path of integration y proceeds 
from r through the lower half of the complex plane to a point on the real axis exceeding any 
k, and from there returns to r through the upper half-plane. 

In testing for the presence or absence of serial correlation (or regression) qg is the sum of 
products of successive observations, and k; = o? cos {rt/(T + 1)}. Replacing this set of 
discrete values in the above integral by a continuous variable of similar distribution, the 
following approximation to the distribution of r is found: 


; 1=—2 ... fs . oe 7 r 1 
h*(r) = —— 227 | (sin @ — r)3T—2. sin (T rT - — @ )-cos! ¢ do 
~ are sins \4 - 































CONSTITUTION 
OF THE 
INSTITUTE OF MATHEMATICAL STATISTICS 


ARTICLE I 


NAME AND PURPOSE 


= h—6CrOULlOGe COU 


1. This organization shall be known as the Institute of Mathematical Statistics. 
2. Its object shall be to promote the interests of mathematical statistics. 


ARTICLE II 


MEMBERSHIP 


ee a a ee ee ee. | 


1. The membership of the Institute shall consist of Members, Fellows, Honorary 
Members, and Sustaining Members. 

2. Voting members of the Institute shall be (a) the Fellows, and (b) all others who 
have been members for twenty-three months prior to the date of voting. 


> ARTICLE III 


OrricERS, Boarp oF DrirREcToRS, COMMITTEE ON MEMBERSHIP, AND COMMITTEE ON 
PUBLICATIONS 


1. The Officers of the Institute shall be a President, two Vice-Presidents, and a Secre- 
tary-Treasurer, elected for a term of one year by a majority ballot at the annual meeting 
of the Institute. Voting may be in person or by mail. 

(a) Exception. The first group of Officers shall be elected by a majority vote of the 
individuals present at the organization meeting, and shall serve until December 31, 1936. 

2. The Board of Directors of the Institute shall consist of the Officers and the previous 
President. 

3. The Institute shall have a Committee on Membership composed of three Fellows. 
At their first meeting subsequent to the adoption of this Constitution, the Board of 
Directors shall elect three members as Fellows to serve as the Committee on Membership, 
one member of the Committee for a term of one year, another for a term of two years, 
and another for a term of three years. Thereafter the Board of Directors shall elect 
from among the Fellows one member annually at their first meeting after their election 
for a term of three years. The president shall designate one of the Vice-Presidents as 
Chairman of this Committee. 

4. The Institute shall have a Committee on Publications composed of three Members 
or Fellows elected by the Board of Directors. The President shall designate a Vice- 
President as Ex Officio Chairman of this Committee. 





ARTICLE IV 


MEETINGS 
1. A meeting for the presentation and discussion of papers, for the election of Officers, 
and for the transaction of other business of the Institute shall be held annually at such 
time as the Board of Directors may designate. Additional meetings may be called from 
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time to time by the Board of Directors and shall be called at any time by the President 
upon written request from ten Fellows. Notice of the time and place of meeting shall be 
given to the membership by the Secretary-Treasurer at least thirty days prior to the 
date set for the meeting. All meetings except executive sessions shall be open to the 
public. Only papers accepted by a Program Committee appointed by the President may 
be presented to the Institute. 

2. The Board of Directors shall hold a meeting immediately after their election and 
again immediately before the expiration of their term. Other meetings of the Board 
may be held from time to time at the call of the President or any two members of the 
Board. Notice of each meeting of the Board, other than the two regular meetings, 
together with a statement of the business to be brought before the meeting, must be 
given to the members of the Board by the Secretary-Treasurer at least five days prior to 
the date set therefor. Should other business be passed upon, any member of the Board 
shall have the right to reopen the question at the next meeting. 

3. The Committee on Membership shall hold a meeting immediately after the annual 
meeting of the Institute. Further meetings of the Committee may be held from time to 
time at the call of the Chairman or any member of the Committee provided notice of such 
call and the purpose of the meeting is given to the members of the Committee by the 
Secretary-Treasurer at least five days before the date set therefor. Should other business 
be passed upon, any member of the Committee shall have the right to reopen the ques- 
tion at the next meeting. 

4. At a regularly convened meeting of the Board of Directors, three members shall 
constitute a quorum. At a regularly convened meeting of the Committee on Member- 
ship, two members shall constitute a quorum. 


ARTICLE V 
PUBLICATIONS 
1. The Annals of Mathematical Statistics shall be the Official Journal for the Institute. 
Other publications may be originated by the Board of Directors as occasion arises. 
ARTICLE VI 
EXPULSION OR SUSPENSION 
1. Except for non-payment of dues, no one shall be expelled or suspended except by 
action of the Board of Directors with not more than one negative vote. 


ARTICLE VII 
AMENDMENTS 


1. This constitution may be amended by an affirmative two-thirds vote at any regu- 
larly convened meeting of the Institute provided notice of such proposed amendment 
shall have been sent to each voting member by the Secretary-Treasurer at least thirty 
days before the date of the meeting at which the proposal is to be acted upon. Voting 
may be in person or by mail. 
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ARTICLE I 
Duties OF THE OFFICERS, Boarp oF DrrecTors, COMMITTEE ON MEMBERSHIP, AND 
CoMMITTEE ON PUBLICATIONS 
1. The President, or in his absence, one of the Vice-Presidents, or in the absence of the 
President and both Vice-Presidents, a Fellow selected by vote of the Fellows present, 
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shall preside at the meetings of the Institute and of the Board of Directors. At meetings 
of the Institute, the presiding officer shall vote only in the case of a tie, but at meetings 
of the Board of Directors he may vote in all cases. At least three months before the date 
of the annual meeting, the President shall appoint a Nominating Committee of three 
members. It shall be the duty of the Nominating Committee to make nominations for 
Officers to be elected at the annual meeting and the Secretary-Treasurer shall notify al] 
voting members at least thirty days before the annual meeting. Additional noming- 
tions may be submitted in writing, if signed by at least ten Fellows of the Institute, up to 
the time of the meeting. 

2. The Secretary-Treasurer shall keep a full and accurate record of the proceedings 
at the meetings of the Institute and of the Board of Directors, send out calls for said 
meetings and, with the approval of the President and the Board, carry on the corre- 
spondence of the Institute. Subject to the direction of the Board, he shall have charge 
of the archives and other tangible and intangible property of the Institute. He shall 
send out calls for annual dues and acknowledge receipt of same; pay all bills approved 
by the President for expenditures authorized by the Board or the Institute; keep a 
detailed account of all receipts and expenditures, prepare a financial statement at the 
end of each year and present an abstract of the same at the annual meeting of the Insti- 
tute after it has been audited by a Member or Fellow of the Institute appointed by the 
President as Auditor. The Auditor shall report to the President. 

3. The Board of Directors shall have charge of the funds and of the affairs of the 
Institute, with the exception of those affairs specifically assigned to the President or to 
the Committee on Membership. The Board shall have authority to fill all vacancies 
ad interim, occurring among the Officers, Board of Directors, or in any of the Committees. 
The Board may appoint such other committees as may be required from time to time 
to carry on the affairs of the Institute. 

4. The Committee on Membership shall prepare and make available through the 
Secretary-Treasurer an announcement indicating the qualifications requisite for the 
different grades of membership. 

5. The Committee on Publications, under the general supervision of the Board of 
Directors, shall have charge of all matters connected with the publications of the Insti- 
tute, and of all books, pamphlets, manuscripts and other literary or scientific material 
collected by the Institute. Once a year this Committee shall cause to be printed in the 
Official Journal the Constitution and By-Laws and a classified list of all the Members 
and Fellows of the Institute. 


ARTICLE II 
DUES 


1. Members shall pay five dollars at the time of admission to membership and shall 
receive the full current volume of the Official Journal. Thereafter, Members shall pay 
five dollars annual dues. The annual dues of Fellows shall be five dollars. The annual 
dues of Sustaining Members shall be fifty dollars. Honorary Members shall be exempt 
from all dues. 

2. Annual dues shall be payable on the first day of January of each year. 

3. The annual dues of a Fellow or Member include a subscription to the Official 
Journal. The annual dues of a Sustaining Member include two subscriptions to the 
Official Journal. 

4. It shall be the duty of the Secretary-Treasurer to notify by mail anyone whose dues 
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may be six months in arrears, and to accompany such notice by a copy of this Article. 
If such person fail to pay such dues within three months from the date of mailing such 
notice, the Secretary-Treasurer shall report the delinquent one to the Board of Directors, 
by whom the person’s name may be stricken from the rolls and all privileges of member- 


ship withdrawn. Such person may, however, be re-instated by the Board of Directors 
upon payment of the arrears of dues. 


ARTICLE III 
SALARIES 


1. The Institute shall not pay a salary to any Officer, Director, or member of any 
committee. 


ARTICLE IV 


AMENDMENTS 


1. These By-Laws may be amended in the same manner as the Constitution or by a 
majority vote at any regularly convened meeting of the Institute, if the proposed amend- 
ment has been previously approved by the Board of Directors. 
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La. 
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Anderson, Theodore W., Jr. B.S. (Northwestern) Instr., Princeton Univ., Princeton, 
N.J. Graduate College. 
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